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TRANSCRIPTIONAL ACTIVATOR NUCLEIC ACIDS, POLYPEPTIDES 
AND METHODS OF USE THEREOF 



TECHNICAL FIELD 

5 The present invention relates generally to plant molecular biology. More 

specifically, it relates to nucleic acids and methods for modulating their 
expression in plants. 

BACKGROUND OF THE INVENTION 

1 0 Major advances in plant transformation have occurred over the last few 

years. However, in major crop plants, such as maize and soybeans, serious 
genotype limitations still exist. Transformation of agronomically important maize 
inbred lines continues to be both difficult and time consuming. Traditionally, the 
only way to elicit a culture response has been by optimizing medium components 

15 and/or explant material and source. This has led to success in some genotypes, 
but most elite hybrids fail to produce a favorable culture response. While, 
transformation of model genotypes is efficient, the process of introgressing 
transgenes into production inbreds is laborious, expensive and time consuming. 
It would save considerable time and money if genes could be introduced into and 

20 evaluated directly in commercial hybrids. 

Current methods for genetic engineering in maize require a specific cell 
type as the recipient of new DNA. These cells are found in relatively 
undifferentiated, rapidly growing callus cells or on the scutellar surface of the 
immature embryo (which gives rise to callus). Irrespective of the delivery method 

25 currently used, DNA is introduced into literally thousands of cells, yet 
transformants are recovered at frequencies of 10' 5 relative to transiently- 
expressing cells. Exacerbating this problem, the trauma that accompanies DNA 
introduction directs recipient cells into cell cycle arrest and accumulating 
evidence suggests that many of these cells are directed into apoptosis or 

30 programmed cell death. (Reference Bowen et al, Third International Congress of 
the International Society for Plant Molecular Biology, 1991, Abstract 1093). 
Therefore it would be desirable to provide improved methods capable of 
increasing transformation efficiency in a number of cell types. 
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Typically a selectable marker is used to recover transformed cells. 
Traditional selection schemes expose all cells to a phytotoxic agent and rely on 
the introduction of a resistance gene to recover transformants. Unfortunately, the 
presence) of dying cells may reduce the efficiency of stable transformation. It 
5 would therefore be useful to provide a positive selection system for recovering 
transformants. 

In spite of increases in yield and harvested area worldwide, it is predicted 
that over the next ten years, meeting the demand for corn will require an 
additional 20% increase over current production (Dowswell, C.R., Paliwal, R.L., 

1 0 Cantrell, R.P., 1 996, Maize in the Third World, Westview Press, Boulder, CO). 

In hybrid crops, including grains, oil seeds, forages, fruits and vegetables, 
there are problems associated with the development and production of hybrid 
seeds. The process of cross-pollination of plants is laborious and expensive. In 
the cross-pollination process, the female plant must be prevented from being 

1 5 fertilized by its own pollen. Many methods have been developed over the years, 
such as detasseling in the case of corn, developing and maintaining male sterile 
lines, and developing plants that are incompatible with their own pollen, to name 
a few. Since hybrids do not breed true, the process must be repeated for the 
production of every hybrid seed lot. 

20 To further complicate the process, inbred lines are crossed. For example 

in the case of corn, the inbreds can be low yielding. This provides a major 
challenge in the production of hybrid seed corn. In fact, certain hybrids cannot be 
commercialized at all due to the performance of the inbred lines. The production 
of hybrid seeds is consequently expensive, time consuming and provides known 

25 and unknown risks. It would therefore be valuable to develop new methods which 
contribute to the increase of production efficiency of hybrid seed. 

As new traits are added to commercial crops by means of genetic 
engineering, problems arise in "stacking" traits. In order to develop heritable 
stacked traits, the traits must be linked because of segregating populations. 

30 Improved methods for developing hybrid seed which would not require linking of 
the traits would significantly shorten the time for developing commercial hybrid 
seeds. 
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Gene silencing is another problem in developing heritable traits with 
genetic engineering. Frequently gene silencing is seen following meiotic 
divisions. Elimination or reduction of this problem would advance the state of 
science and industry in this area. 

5 

SUMMARY OF THE INVENTION 

It is the object of the present invention to provide nucleic acids and 
polypeptides relating to embryogenesis. 

It is another object of the present invention to provide nucleic acids and 
10 polypeptides that can be used to identify interacting proteins involved in 
transcription regulation in embryogenesis. 

It is another object of the present invention to provide antigenic fragments 
of the polypeptides of the present invention. 

It is another object of the present invention to provide transgenic plants 
1 5 and plant parts containing the nucleic acids of the present invention. 

It is another object of the present invention to provide methods for 
modulating, in a transgenic plant, the expression of the nucleic acids of the 
present invention. 

It is another object of the present invention to provide a method for 
20 improving transformation frequencies. 

It is another object of the present invention to provide a method for 
improving transformation efficiency in cells from various sources. 

It is another object of the present invention to provide a method for a 
positive selection system. 
25 It is another object of the present invention to provide a method for 

efficiently producing hybrid seed via apomixis. 

It is another object of the present invention to provide a method for 
stacking traits which does not require linking of traits. 

It is another object of the present invention to provide a method for 
30 reducing the problem of gene silencing. 

The present invention relates to a HAP3-type CCAAT-box binding 
transcriptional activator polynucleotides and polypeptides, and in particular, the 
leafy cotyledon 1 transcriptional activator (LEC1) polynucleotides and 
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polypeptides. In other aspects the present invention relates to expression 
cassettes optionally linked in antisense orientation, host cells transfected with at 
least one expression cassette, and transgenic plants and seeds comprising the 
expression cassettes. Further aspects of the invention include methods of using 
5 the polynucleotides and polypeptides. In a further aspect, the present invention 
relates to a method of modulating expression of the polynucleotides encoding the 
polypeptides of the present invention in a plant. Expression of the 
polynucleotides encoding the proteins of the present invention can be increased 
or decreased relative to a non-transformed control plant. 

10 

BRIEF DESCRIPTION OF THE DRAWING 

Figure 1 depicts the comparison of various sequences and the alignment of 
the conserved regions. 

15 DETAILED DESCRIPTION OF THE INVENTION 

DEFINITIONS 

The term "isolated" refers to material, such as a nucleic acid or a protein, 

which is: (1) substantially or essentially free from components which normally 

accompany or interact with the material as found in its naturally occurring 
20 environment or (2) if the material is in its natural environment, the material has 

been altered by deliberate human intervention to a composition and/or placed at a 

locus in the cell other than the locus native to the material. 

As used herein, "nucleic acid" means a polynucleotide and includes single 

or double-stranded polymer of deoxyribonucleotide or ribonucleotide bases. 
25 Nucleic acids may also include modified nucleotides that permit correct read 

through by a polymerase and do not alter the expression of a polypeptide 

encoded by the polynucleotide. 

As used herein, "LEC1 nucleic acid" means a nucleic acid or 

polynucleotide that codes for a LEC1 polypeptide. 
30 As used herein, "polypeptide" means proteins, protein fragments, modified 

proteins, amino acid sequences and synthetic amino acid sequences. The 

polypeptide can be glycosylated or not. 
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As used herein, "LEC1 polypeptide" means a HAP3 family member, 
CCAAT-box binding transcriptional activator polypeptide that regulates gene 
expression during embryo development, and that contains the conserved 
sequence set out in SEQ ID NO: 23. 
5 As used herein, "plant" includes plants and plant parts including but not 

limited to plant cells, plant tissue such as leaves, stems, roots, flowers, and 
seeds. 

As used herein, "promoter" includes reference to a region of DNA upstream 
from the start of transcription and involved in recognition and binding of RNA 

1 0 polymerase and other proteins to initiate transcription. 

By "fragment" is intended a portion of the nucleotide sequence or a portion 
of the amino acid sequence and hence protein encoded thereby. Fragments of a 
nucleotide sequence may encode protein fragments that retain the biological 
activity of the native nucleic acid. Alternatively, fragments of a nucleotide 

1 5 sequence that are useful as hybridization probes generally do not encode 
fragment proteins retaining biological activity. Thus, fragments of a nucleotide 
sequence are generally greater than 20, 30, 50, 100, 150, 200 or 300 nucleotides 
and up to the entire nucleotide sequence encoding the proteins of the invention. 
Generally the probes are less than 1000 nucleotides and preferably less than 500 

20 nucleotides. Fragments of the invention include antisense sequences used to 
decrease expression of the inventive polynucleotides. Such antisense fragments 
may vary in length ranging from at least about 20 nucleotides, about 50 
nucleotides, about 100 nucleotides, up to and including the entire coding 
sequence. 

25 By "functional equivalenf as applied to a polynucleotide or a protein is 

intended a polynucleotide or a protein of sufficient length to modulate the level of 
LEC1 protein activity in a plant cell. A polynucleotide functional equivalent can 
be in sense or antisense orientation. 

By "variants" is intended substantially similar sequences. Generally, 

30 nucleic acid sequence variants of the invention will have at least 60%, 65%, or 
70%, preferably 75%, 80% or 90%, more preferably at least 95% and most 
preferably at least 98% sequence identity to the native nucleotide sequence, 
wherein the % sequence identity is based on the entire sequence and is 



WO 00/28058 PCT/US99/26S1 4 

-6- 

determined by GAP analysis using Gap Weight of 50 and Length Weight of 3. 
Generally, polypeptide sequence variants of the invention will have at least about 
50%, 55%, 60%, 65%, 70%, 75% or 80%, preferably at least about 85% or 90%, 
and more preferably at least about 95% sequence identity to the native protein, 

5 wherein the % sequence identity is based on the entire sequence and is 
determined by GAP analysis using Gap Weight of 12 and Length Weight of 4. 

A "responsive plant cell" or "responsive host cell" refers to a cell that 
exhibits a positive response to the introduction of LEC1 polypeptide or LEC1 
polynucleotide compared to a cell that has not been introduced with LEC1 

10 polypeptide or LEC1 polynucleotide. The response can be to enhance tissue 
culture response, induce somatic embryogenesis, induce apomlxis, increase 
transformation efficiency or increase recovery of regenerated plants. 

A "recalcitrant plant cell" is a responsive plant cell that generally does not 
exhibit a positive response such as tissue culture response, transformation 

1 5 efficiency or recovery of regenerated plants. 



NUCLEIC ACIDS 

The present invention relates to a HAP3-type CCAAT-box binding 
transcriptional activators, and in particular, the leafy cotyledon 1 transcriptional 

20 activator (LEC1 ). Expression of the LEC1 polynucleotide initiates formation of 
embryo-like structures and improves growth and recovery of transformants. The 
term apomixis is used to describe asexual reproduction that replaces or 
substitutes sexual methods of reproduction. When apomixis occurs, embryos are 
produced from maternal tissue and use only the maternal genome. 

25 In particular the present invention relates to an isolated nucleic acid 

comprising a member selected from the group consisting of: 

(a) a polynucleotide which encodes a polypeptide of SEQ ID NO: 2, 8, 
10, 12, 14, 16, 18, 20, or 22; 

(b) a polynucleotide amplified from a plant nucleic acid library using the 
30 primers of SEQ ID NOS: 3 and 4, 5 and 6, 9 and 1 0, or 1 1 and 12 or 

primers determined by using Vector nti Suite, InforMax Version 5. 

(c) a polynucleotide comprising at least 20 contiguous bases of SEQ ID 
NO: 1,7, 9,11,13, 15, 17,19,or21; 
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(d) a polynucleotide encoding a plant HAP3-type ccaat-box transcriptional 
activator with the conserved motif of SEQ ID NO: 23, wherein the 
polynucleotide is from a plant other than Arabidopsis; 

(e) a polynucleotide having at least 60% sequence identity to SEQ ID NO: 
5 1, 9, 11, 13, 17, or 21 or65% sequence identity to SEQ ID NO: 15 or 

19 or 70% sequence identity to SEQ ID NO: 7, wherein the % 
sequence identity is based on the entire sequence and is determined 
by GAP analysis using Gap Weight of 50 and Length Weight of 3; 

(f) a polynucleotide comprising at least 25 nucleotides in length which 
1 o hybridizes under high stringency conditions to a polynucleotide 

having the sequence set forth in SEQ ID NO: 1, 7, 9, 11, 13, 15, 17, 
19, or 21; 

(g) a polynucleotide encoding the protein of SEQ ID NO: 2, 8, 1 0, 12, 
14, 16, 18, 20, or 22, wherein the polynucleotide is from a plant 

15 other than Arabidopsis; 

(h) a polynucleotide having the sequence set forth in SEQ ID NO: 1 , 7, 9, 
11,13,15, 17, 19, or 21; and 

(i) a polynucleotide complementary to a polynucleotide of (a) through 

(h). 

20 In many cases of apomixis maternal tissues such as the nucellus or inner 

integument "bud off" producing somatic embryos. These embryos then develop 
normally into seed. Since meiosis and fertilization are circumvented, the plants 
developing from such seed are genetically identical to the maternal plant. 
Expression of the leafy cotyledon 1 gene in the nucellus integument, or cell 

25 specific expression in the megaspore mother cell would trigger embryo formation 
from maternal tissues. 

Producing a seed identical to the parent has many advantages. For 
example high yielding hybrids could be used in seed production to multiply 
identical copies of high yielding hybrid seed. This would greatly reduce seed cost 

30 as well as increase the number of genotypes which are commercially available. 
Genes can be evaluated directly in commercial hybrids since the progeny would 
not segregate. This would save years of back crossing. 
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Apomixis would also provide a method of containment of transgenes when 
coupled with male sterility. The construction of male sterile autonomous 
agamospermy would prevent genetically engineered traits from hybridizing with 
weedy relatives. 

5 Gene stacking would be relatively easy with apomixis. Hybrids could be 

successively re-transformed with various new traits and propagated via apomixis. 
The traits would not need to be linked since apomixis avoids the problems 
associated with segregation. 

Apomixis can provide a reduction in gene silencing. Gene silencing is 

1 0 frequently seen following meiotic divisions. Since meiotic divisions never occur, it 
may be possible to eliminate or reduce the frequency of gene silencing. Apomixis 
can also be used stabilize desirable phenotypes with complex traits such as 
hybrid vigor. Such traits could easily be maintained and multiplied indefinitely via 
apomixis. 

1 5 The Cauliflower Mosaic Vims 35S promoter has been used to overexpress 

LEC1 during Agrooacferii/m-mediated in planta transformation of Arabidopsis 
(Harada et a/., WO 98/37184). As pointed out by Harada etal., 35S is a strong 
promoter, and in their experiments it was found that 35S:LEC1 did not improve 
transformation and actually appeared to hinder it (transformation efficiency with 

20 35S:LEC1 was estimated to be 0.6% of that obtained normally). Thus, 
overexpression in a cell type such as those in the gametophytic stage of 
development may be inappropriate and detrimental to the transformation process 
and successful recovery of transformed progeny. In contrast, we have shown that 
ectopic expression of the LEC1 gene under the appropriate control elements 

25 (including tissue specific and/or inducible promoters) and in the appropriate plant 
cells can be used to stimulate embryo formation in tissues/genotypes normally not 
amenable to culture. Likewise ectopic expression in genotypes amenable to 
culture can increase the number of embryo precursor cells (or increase the 
number that develop into embryos) leading to an increase in transformation 

30 frequency. Transient expression using RNA or protein may be sufficient to initiate 
the cascade of events leading to embryo formation. This would be valuable in 
such target tissues as maize scutella, immature leaf bases, immature tassels, etc. 
The LEC1 gene could be used as a positive selectable marker, i.e. triggering 
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embryogenesis in transgenic cells without killing the surrounding wild-type cells. 
This would happen since the cells receiving the introduced gene would undergo 
embryogenesis or in tissues already undergoing embryogenesis LEC1 expression 
would stimulate more rapid reiteration of somatic embryos. 

5 It has been shown through sequence similarity that the Arabidopsis LEC1 

polypeptide is homologous to the HAP3 subunit of the "CCAAT-box binding 
factor" class of eukaryotic transcriptional activators (Lotan etal., 1998, Cell 
93:1 1 95-1205). This class of proteins, which consist of Hap2/3/4 and 5, form a 
heteroligomeric transcriptional complex, that appears to activate specific gene 

1 0 sets in eukaryotes. Certain members of this family such as Hap2 and Hap5 
appear to be ubiquitously expressed, while different Hap3 members are under 
developmental or environmental regulation. Plant HAP3 polypeptides can be 
recognized by a high degree of sequence identity to other HAP3 homologs in the 
"B domain" of the protein. For example, the B domain for the Arabidopsis LEC1 , 

1 5 from amino acid residue 28 to residue 1 1 7, shares between 55% and 63% identity 
(75-85% similarity) to other members of the HAP3 family, including maize (HAP3), 
chicken, lamprey, Xenopus, human, mouse, Emericella nidulens, 
Schizosaccharomyces pombe, Saccharomyces cerevisiae and Kluuyveromyces 
lactis (Lotan et ai, 1998). 

20 Expression of the LEC1 gene in transformed cells initiates embryo 

development and stimulates development of pre-existing embryos. Normally, 
LEC1 expression is necessary for proper embryo maturation in the latter stages of 
embryo development, and LEC1 transgene expression thus may also promote 
these processes. The combined effect of these impacts on somatic 

25 embryogenesis is not only to stimulate growth of transformed cells, but also to 
insure that transformed somatic embryos develop in a normal, viable fashion 
(increasing the capacity of transformed somatic embryos to germinate vigorously). 
Continued ectopic overexpression beyond embryo maturation may negatively 
impact germination and vegetative plant growth (which may necessitate down- 

30 regulation of the LEC1 transgene during these stages of development. 

Expression of the LEC1 gene will stimulate growth in cells with the 
potential to initiate or maintain embryogenic growth. Cells in established 
meristems or meristem-derive cell lineages may be less prone to undergo the 
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transition to embryos. In addition, transformation methods that target certain 
reproductive tissues (or cells) such as vacuum-infiltration of Agrobacterium into 
Arabidopsis may have detrimental effects on recovery of transformants (triggering 
genes associated with embryogenesis may disrupt the proper functioning of these 
5 cells). 

The polypeptides encoded by the present plant LEC1 genes can be 
distinguished from non-LEC HAP3 proteins by using the diagnostic motif shown in 
SEQ ID NO: 23. 

The isolated nucleic acids of the present invention can be made using (a) 

1 0 standard recombinant methods, (b) synthetic techniques, or combinations thereof . 
In some embodiments, the polynucleotides of the present invention will be cloned, 
amplified, or otherwise constructed from a monocot or dicot. In preferred 
embodiments the monocot is corn, sorghum, barley, wheat, millet, or rice. 
Preferred dicots include soybeans, sunflower, canola, alfalfa, potato, or cassava. 

1 5 Functional fragments included in the invention can be obtained using 

primers which selectively hybridize under stringent conditions. Primers are 
generally at least 12 bases in length and can be as high as 200 bases, but will 
generally be from 15 to 75, preferably from 15 to 50. Functional fragments can be 
identified using a variety of techniques such as restriction analysis, Southern 

20 analysis, primer extension analysis, and DNA sequence analysis. 

The present invention includes a plurality of polynucleotides that encode 
for the identical amino acid sequence. The degeneracy of the genetic code 
allows for such "silent variations" which can be used, for example, to selectively 
hybridize and detect allelic variants of polynucleotides of the present invention. 

25 Additionally, the present invention includes isolated nucleic acids comprising 

allelic variants. The term "allele" as used herein refers to a related nucleic acid of 
the same gene. 

Variants of nucleic acids included in the invention can be obtained, for 
example, by oligonucleotide-directed mutagenesis, linker-scanning mutagenesis, 
30 mutagenesis using the polymerase chain reaction, and the like. See, for example, 
Ausubel, pages 8.0.3 - 8.5.9. Also, see generally, McPherson (ed.), DIRECTED 
MUTAGENESIS: A Practical Approach, (IRL Press, 1991). Thus, the present 
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invention also encompasses DNA molecules comprising nucleotide sequences 
that have substantial sequence similarity with the inventive sequences. 

Variants included in the invention may contain individual substitutions, 
deletions or additions to the nucleic acid or polypeptide sequences which alters, 

5 adds or deletes a single amino acid or a small percentage of amino acids in the 
encoded sequence is a "conservatively modified variant" where the alteration 
results in the substitution of an amino acid with a chemically similar amino acid. 
When the nucleic acid is prepared or altered synthetically, advantage can be 
taken of known codon preferences of the intended host. 

1 o The present invention also includes "shufflents" produced by sequence 

shuffling of the inventive polynucleotides to obtain a desired characteristic. 
Sequence shuffling is described in PCT publication No. 96/19256. See also, 
Zhang, J. H., etal., Proc. Natl. Acad. Sci. USA 94:4504-4509 (1997). 

The present invention also includes the use of 5' and/or 3' UTR regions for 

1 5 modulation of translation of heterologous coding sequences. Positive sequence 
motifs include translational initiation consensus sequences (Kozak, Nucleic Acids 
Res. 15:81 25 (1987)) and the 7-methylguanosine cap structure (Drummond et ai, 
Nucleic Acids Res. 1 3:7375 (1 985)). Negative elements include stable 
intramolecular 5" UTR stem-loop structures (Muesing etal., Cell 48:691 (1987)) 

20 and AUG sequences or short open reading frames preceded by an appropriate 
AUG in the 5' UTR (Kozak, supra, Rao et ai, Mol. and Cell. Biol. 8:284 (1988)). 

Further, the polypeptide-encoding segments of the polynucleotides of the 
present invention can be modified to alter codon usage. Altered codon usage can 
be employed to alter translational efficiency. Codon usage in the coding regions 

25 of the polynucleotides of the present invention can be analyzed statistically using 
commercially available software packages such as "Codon Preference" available 
from the University of Wisconsin Genetics Computer Group (see Devereaux et ai, 
Nucleic Acids Res. 12:387-395 (1 984)) or MacVector 4.1 (Eastman Kodak Co., 
New Haven, Conn.). 

30 For example, the inventive nucleic acids can be optimized for enhanced 

expression in plants of interest. See, for example, EPA0359472; W091/16432; 
Perlak et ai (1991) Proc. Natl. Acad. Sci. USA 88:3324-3328; and Murray et ai 
(1989) Nucleic Acids Res. 77:477-498 In this manner, the polynucleotides can 
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be synthesized utilizing plant-preferred codons. See, for example, Murray et al. 
(1989) Nucleic Acids Res. 1 7:477-498. the disclosure of which is incorporated 
herein by reference. 

The present invention provides subsequences comprising isolated nucleic 
acids containing at least 16 contiguous bases of the inventive sequences. For 
example the isolated nucleic acid includes those comprising at least 16, 20, 25, 
30, 40, 50, 60, 75 or 100 contiguous nucleotides of the inventive sequences. 
Subsequences of the isolated nucleic acid can be used to modulate or detect 
gene expression by introducing into the subsequences compounds which bind, 
intercalate, cleave and/or crosslink to nucleic acids. 

The nucleic acids of the invention may conveniently comprise a multi- 
cloning site comprising one or more endonuclease restriction sites inserted into 
the nucleic acid to aid in isolation of the polynucleotide. Also, translatable 
sequences may be inserted to aid in the isolation of the translated polynucleotide 
of the present invention. For example, a hexa-histidine marker sequence 
provides a convenient means to purify the proteins of the present invention. 

A polynucleotide of the present invention can be attached to a vector, 
adapter, promoter, transit peptide or linker for cloning and/or expression of a 
polynucleotide of the present invention. Additional sequences may be added to 
such cloning and/or expression sequences to optimize their function in cloning 
and/or expression, to aid in isolation of the polynucleotide, or to improve the 
introduction of the polynucleotide into a cell. Use of cloning vectors, expression 
vectors, adapters, and linkers is well known and extensively described in the art. 
For a description of such nucleic acids see, for example, Stratagene Cloning 
Systems, Catalogs 1995, 1996, 1997 (La Jolla, CA); and, Amersham Life 
Sciences, Inc. Catalog '97 (Arlington Heights, IL). 

The isolated nucleic acid compositions of this invention, such as RNA, 
cDNA genomic DNA, or a hybrid thereof, can be obtained from plant biological 
sources using any number of cloning methodologies known to those of skill in the 
art. In some embodiments, oligonucleotide probes which selectively hybridize, 
under stringent conditions, to the polynucleotides of the present invention are 
used to identify the desired sequence in a cDNA or genomic DNA library. 
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Exemplary total RNA and mRNA isolation protocols are described in Plant 
Molecular Biology: A Laboratory Manual, Clark, Ed., Springer-Veriag, Berlin 
(1997); and, Current Protocols in Molecular Biology, Ausubel, era/., Eds., Greene 
Publishing and Wiley-lnterscience, New York (1995). Total RNA and mRNA 

5 isolation kits are commercially available from vendors such as Stratagene (La 
Jolla, CA), Clonetech (Palo Alto, CA), Pharmacia (Piscataway, NJ), and 5*-3' 
(Paoli, PA). See also, U.S. Patent Nos. 5,614,391; and, 5,459,253. 

Typical cDNA synthesis protocols are well known to the skilled artisan and 
are described in such standard references as: Plant Molecular Biology: A 

1 0 Laboratory Manual, Clark, Ed., Springer-Veriag, Berlin (1 997); and, Current 
Protocols in Molecular Biology, Ausubel, etal., Eds., Greene Publishing and 
Wiley-lnterscience, New York (1995). cDNA synthesis kits are available from a 
variety of commercial vendors such as Stratagene or Pharmacia. 

An exemplary method of constructing a greater than 95% pure full-length 

15 cDNA library is described by Caminci et a/., Genomics, 37:327-336 (1 996). Other 
methods for producing full-length libraries are known in the art. See, e.g., Edery 
etal., Mol. Cell Biol. 15(6):3363-3371 (1995); and PCT Application WO 96/34981. 

It is often convenient to normalize a cDNA library to create a library in 
which each clone is more equally represented. A number of approaches to 

20 normalize cDNA libraries are known in the art. Construction of normalized 

libraries is described in Ko, Nucl. Acids. Res. 18(19):5705-5711 (1990); Patanjali 
etal., Proc. Natl. Acad. U.S.A. 88:1943-1947 (1991); U.S. Patents 5,482,685 and 
5,637,685; and Soares etal., Proc. Natl. Acad. Sci. USA 91:9228-9232 (1994). 
Subtracted cDNA libraries are another means to increase the proportion of 

25 less abundant cDNA species. See, Foote et al. in, Plant Molecular Biology: A 
Laboratory Manual, Clark, Ed., Springer-Veriag, Berlin (1997); Kho and Zarbl, 
Technique 3(2):58-63 (1991); Sive and St. John, Nucl. Acids Res. 16(22): 10937 
(1988); Current Protocols in Molecular Biology, Ausubel, etal., Eds., Greene 
Publishing and Wiley-lnterscience, New York (1995); and, Swaroop et al., Nucl. 

30 Acids Res. 19(8):1954 (1991 ). cDNA subtraction kits are commercially available. 
See, e.g., PCR-Select (Clontech). 

To construct genomic libraries, large segments of genomic DNA are 
generated by random fragmentation. Examples of appropriate molecular 
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biological techniques and instructions are found in Sambrook, et al., Molecular 
Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory. Vols. 1-3 
(1989), Methods in Enzymology, Vol. 152: Guide to Molecular Cloning 
Techniques, Berger and Wmmel, Eds., San Diego: Academic Press, Inc. (1987), 

5 Current Protocols in Molecular Biology, Ausubel, et al., Eds., Greene Publishing 
and Wiley-lnterscience, New York (1995); Plant Molecular Biology: A Laboratory 
Manual, Clark, Ed., Springer-Verlag, Berlin (1997). Kits for construction of 
genomic libraries are also commercially available. 

The cDNA or genomic library can be screened using a probe based upon 

1 0 the sequence of a nucleic acid of the present invention such as those disclosed 
herein. Probes may be used to hybridize with genomic DNA or cDNA sequences 
to isolate homologous polynucleotides in the same or different plant species. 
Those of skill in the art will appreciate that various degrees of stringency of 
hybridization can be employed in the assay; and either the hybridization or the 

15 wash medium can be stringent. The degree of stringency can be controlled by 
temperature, ionic strength, pH and the presence of a partially denaturing solvent 
such as formamide. 

Typically, stringent hybridization conditions will be those in which the salt 
concentration is less than about 1.5 M Na ion, typically about 0.01 to 1.0 M Na ion 

20 concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least 
about 30°C for short probes (e.g., 10 to 50 nucleotides) and at least about 60°C 
for long probes (e.g., greater than 50 nucleotides). Stringent conditions may also 
be achieved with the addition of destabilizing agents such as formamide. 
For purposes of defining the invention the following conditions are 

25 provided. Exemplary low stringency conditions include hybridization with a buffer 
solution of 30 to 35% formamide, 1 M NaCI, 1% SDS (sodium dodecyl sulfate) at 
37°C, and a wash in 1X to 2X SSC (20X SSC = 3.0 M NaCI/0.3 M trisodium 
citrate) at 50°C. Exemplary moderate stringency conditions include hybridization 
in 40 to 45% formamide, 1 M NaCI, 1 % SDS at 37°C, and a wash in 0.5X to 1X 

30 SSC at 55°C. Exemplary high stringency conditions include hybridization in 50% 
formamide, 1 M NaCI, 1% SDS at 37°C, and a wash in 0.1 X SSC at 60°C. 
Typically the time of hybridization is from 4 to 16 hours. 
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An extensive guide to the hybridization of nucleic acids is found in Tijssen, 
Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with 
Nucleic Acid Probes, Part I, Chapter 2 "Overview of principles of hybridization and 
the strategy of nucleic acid probe assays", Elsevier, New York (1 993); and 
Current Protocols in Molecular Biology, Chapter 2, Ausubel, etai, Eds., Greene 
Publishing and Wiley-lnterscience, New York (1 995). Often, cDNA libraries will 
be normalized to increase the representation of relatively rare cDNAs. 

The nucleic acids of the invention can be amplified from nucleic acid 
samples using amplification techniques. For instance, polymerase chain reaction 
(PCR) technology can be used to amplify the sequences of polynucleotides of the 
present invention and related polynucleotides directly from genomic DNA or 
cDNA libraries. PCR and other in vitro amplification methods may also be useful, 
for example, to clone nucleic acid sequences that code for proteins to be 
expressed, to make nucleic acids to use as probes for detecting the presence of 
the desired mRNA in samples, for nucleic acid sequencing, or for other purposes. 

Examples of techniques useful for in vitro amplification methods are found 
in Berger, Sambrook, and Ausubel, as well as Mullis etai, U.S. Patent No. 
4,683,202 (1987); and, PCR Protocols A Guide to Methods and Applications, Innis 
etai., Eds., Academic Press Inc., San Diego, CA (1990). Commercially available 
kits for genomic PCR amplification are known in the art. See, e.g., Advantage-GC 
Genomic PCR Kit (Clontech). The T4 gene 32 protein (Boehringer Mannheim) 
can be used to improve yield of long PCR products. PCR-based screening 
methods have also been described. Wilfinger et al. describe a PCR-based 
method in which the longest cDNA is identified in the first step so that incomplete 
clones can be eliminated from study. BioTechniques, 22(3):481-486 (1997). 

In one aspect of the invention, nucleic acids can be amplified from a plant 
nucleic acid library. The nucleic acid library may be a cDNA library, a genomic 
library, or a library generally constructed from nuclear transcripts at any stage of 
intron processing. Libraries can be made from a variety of plant tissues. Good 
results have been obtained using mitotically active tissues such as shoot 
meristems, shoot meristem cultures, embryos, callus and suspension cultures, 
immature ears and tassels, and young seedlings. The cDNAs of the present 
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invention were obtained from immature zygotic embryo and regenerating callus 
libraries. 

Alternatively, the sequences of the invention can be used to isolate 
corresponding sequences in other organisms, particularly other plants, more 
5 particularly, other monocots. In this manner, methods such as PCR, 
hybridization, and the like can be used to identify such sequences having 
substantial sequence similarity to the sequences of the invention. See, for 
example, Sambrook ef a/. (1989) Molecular Cloning: A Laboratory Manual (2d ed., 
Cold Spring Harbor Laboratory Press, Plainview, New York), and Innis et al. 

10 (1 990), PCR Protocols: A Guide to Methods and Applications (Academic Press, 
New York). Coding sequences isolated based on their sequence identity to the 
entire inventive coding sequences set forth herein or to fragments thereof are 
encompassed by the present invention. 

The isolated nucleic acids of the present invention can also be prepared by 

1 5 direct chemical synthesis by methods such as the phosphotriester method of 
Narang et al., Meth. Enzymol. 68:90-99 (1979); the phosphodiester method of 
Brown et al., Meth. Enzymol. 68:109-151 (1979); the diethylphosphoramidite 
method of Beaucage etai, Tetra. Lett. 22:1859-1862 (1981); the solid phase 
phosphoramidite triester method described by Beaucage and Caruthers, Tetra. 

20 Letts. 22(20):1859-1862 (1981), e.g., using an automated synthesizer, e.g., as 
described in Needham-VanDevanter ef al., Nucleic Acids Res. 12:6159-6168 
(1984); and, the solid support method of U.S. Patent No. 4,458,066. Chemical 
synthesis generally produces a single stranded oligonucleotide. This may be 
converted into double stranded DNA by hybridization with a complementary 

25 sequence, or by polymerization with a DNA polymerase using the single strand as 
a template. One of skill will recognize that while chemical synthesis of DNA is 
limited to sequences of about 100 bases, longer sequences may be obtained by 
the ligation of shorter sequences. 

The nucleic acids of the present invention include those amplified using the 

30 following primer pairs: SEQ ID NOS: 3 and 4, 5 and 6, 9 and 10, or 1 1 and 12 or 
primers determined by using Vector nti Suite, InforMax Version 5. 
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EXPRESSION CASSETTES 

In another embodiment expression cassettes comprising isolated nucleic 
acids of the present invention are provided. An expression cassette will typically 
comprise a polynucleotide of the present invention operably linked to 
5 transcriptional initiation regulatory sequences which will direct the transcription of 
the polynucleotide in the intended host cell, such as tissues of a transformed 
plant 

The construction of such expression cassettes which can be employed in 
conjunction with the present invention is well known to those of skill in the art in 

10 light of the present disclosure. See, e.g., Sambrook, et a/.; Molecular Cloning: A 
Laboratory Manual; Cold Spring Harbor, New York; (1989); Gelvin, era/.; Plant 
Molecular Biology Manual (1990); Plant Biotechnology: Commercial Prospects 
and Problems, eds. Prakash, et a/.; Oxford & IBH Publishing Co.; New Delhi, 
India; (1993); and Heslot, et at, Molecular Biology and Genetic Engineering of 

15 Yeasts; CRC Press, Inc., USA; (1992); each incorporated herein in its entirety by 
reference. 

For example, plant expression vectors may include (1) a cloned plant gene 
under the transcriptional control of 5' and 3' regulatory sequences and (2) a 
dominant selectable marker. Such plant expression vectors may also contain, if 
20 desired, a promoter regulatory region (e.g., one conferring inducible, constitutive, 
environmentally- or developmentally-regulated, or cell- or tissue-specific/selective 
expression), a transcription initiation start site, a ribosome binding site, an RNA 
processing signal, a transcription termination site, and/or a polyadenylation 
signal. 

25 Constitutive, tissue-preferred or inducible promoters can be employed. 

Examples of constitutive promoters include the cauliflower mosaic virus (CaMV) 
35S transcription initiation region, the 1'- or 2'- promoter derived from T-DNA of 
Agrobacterium tumefaciens, the actin promoter, the ubiquitin promoter, the 
histone H2B promoter (Nakayama et a/., 1992, FEBS Lett 30:167-170), the Smas 

30 promoter, the cinnamyl alcohol dehydrogenase promoter (U.S. Patent No. 
5,683,439), the Nos promoter, the pEmu promoter, the rubisco promoter, the 
GRP1-8 promoter, and other transcription initiation regions from various plant 
genes known in the art. 
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Examples of inducible promoters are the Adh1 promoter which is inducible 
by hypoxia or cold stress, the Hsp70 promoter which is inducible by heat stress, 
the PPDK promoter which is inducible by light, the In2 promoter which is safener 
induced, the ERE promoter which is estrogen induced and the Pepcarboxylase 
promoter which is light induced. 

Examples of promoters under developmental control include promoters that 
initiate transcription preferentially in certain tissues, such as leaves, roots, fruit, 
seeds, or flowers. An exemplary promoter is the anther specific promoter 5126 
(U.S. Patent Nos. 5,689,049 and 5,689,051). Examples of seed-preferred 
promoters include, but are not limited to, 27 kD gamma zein promoter and waxy 
promoter, Boronat, A., Martinez, M.C., Reina, M., Puigdomenech, P. and Palau, 
J.; Isolation and sequencing of a 28 kD glutelin-2 gene from maize: Common 
elements in the 5' flanking regions among zein and glutelin genes; Plant Sci. 
47:95-102 (1986) and Reina, M., Ponte, I., Guillen, P., Boronat, A. and Palau, J., 
Sequence analysis of a genomic clone encoding a Zc2 protein from Zea mays 
W64 A, Nucleic Acids Res. 1 8(21 ):6426 (1 990). See the following site relating to 
the waxy promoter: Kloesgen, R.B., Gierl.A, Schwarz-Sommer, Z.S. and Saedler, 
H., Molecular analysis of the waxy locus of Zea mays, Mol. Gen. Genet. 203:237- 
244 (1986). The disclosures each of these are incorporated herein by reference 
in their entirety. 

Preferably a weak constitutive promoter, such as the Nos promoter, an 
inducible promoter, such as In2, or a nucellus-preferred or integument-preferred 
promoter are used to induce apospory. For example the barley or maize Nuc1 
promoter, the maize Cim 1 promoter or the maize LTP2 promoter can be used to 
preferentially express in the nucellus. See for example US Serial No. 60/097,233 
filed August 20, 1998 the disclosure of which is incorporated herein by reference. 

Either heterologous or non-heterologous (i.e., endogenous) promoters can 
be employed to direct expression of the nucleic acids of the present invention. 
These promoters can also be used, for example, in expression cassettes to drive 
expression of antisense nucleic acids to reduce, increase, or alter concentration 
and/or composition of the proteins of the present invention in a desired tissue. 

If polypeptide expression is desired, it is generally desirable to include a 
polyadenylation region at the 3'-end of a polynucleotide coding region. The 



WO 00/28058 PCT/US99/26514 

-19- 

polyadenylation region can be derived from the natural gene, from a variety of 
other plant genes, or from T-DNA. The 3' end sequence to be added can be 
derived from, for example, the nopaline synthase or octopine synthase genes, or 
alternatively from another plant gene, or less preferably from any other eukaryotic 
5 gene. 

An intron sequence can be added to the 5' untranslated region or the 
coding sequence of the partial coding sequence to increase the amount of the 
mature message that accumulates. See for example Buchman and Berg, Mol. 
Cell Biol. 8:4395-4405 (1 988); Callis et a/., Genes Dev. 1:11 83-1 200 (1 987). Use 

1 0 of maize introns Adh1 -S intron 1 , 2, and 6, the Bronze-1 intron are known in the 
art. See generally, The Maize Handbook, Chapter 116, Freeling and Walbot, 
Eds., Springer, New York (1994). 

The vector comprising the sequences from a polynucleotide of the present 
invention will typically comprise a marker gene which confers a selectable 

1 5 phenotype on plant cells. Usually, the selectable marker gene will encode 
antibiotic or herbicide resistance. Suitable genes include those coding for 
resistance to the antibiotics spectinomycin and streptomycin (e.g., the aada 
gene), the streptomycin phosphotransferase (SPT) gene coding for streptomycin 
resistance, the neomycin phosphotransferase (NPTII) gene encoding kanamycin 

20 or geneticin resistance, the hygromycin phosphotransferase (HPT) gene coding 
for hygromycin resistance. 

Suitable genes coding for resistance to herbicides include those which act 
to inhibit the action of acetolactate synthase (ALS), in particular the 
sulfonylurea-type herbicides (e.g., the acetolactate synthase (ALS) gene 

25 containing mutations leading to such resistance in particular the S4 and/or Hra 
mutations), those which act to inhibit action of glutamine synthase, such as 
phosphinothricin or basta (e.g., the bar gene), or other such genes known in the 
art. The bar gene encodes resistance to the herbicide basta and the ALS gene 
encodes resistance to the herbicide chlorsulfuron. While useful in conjunction 

30 with the above antibiotic and herbicide-resistance selective markers (i.e. use of 
the LEC1 gene can increase transformation frequencies when using chemical 
selection), a prefered use of LEC1 expression takes advantage of this gene 
conferring a growth advantage to transformed cells without the need for inhibitory 
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compounds to retard non-transformed growth. Thus, LEC1 transformants are 
recovered based solely on their differential growth advantage. 

Typical vectors useful for expression of genes in higher plants are well 
known in the art and include vectors derived from the tumor-inducing (Ti) plasmid 

5 of Agrobacterium tumefaciens described by Rogers et al. , Meth. In Enzymol. 
153:253-277 (1987). Exemplary A. tumefaciens vectors useful herein are 
plasmids pKYLX6 and pKYLX7 of Schardl etal., Gene, 61:1-11 (1987) and Berger 
et al., Proc. Natl. Acad. Sa USA 86:8402-8406 (1989). Another useful vector 
herein is plasmid pBI101.2 that is available from Clontech Laboratories, Inc. (Palo 

10 Alto, CA). 

A variety of plant viruses that can be employed as vectors are known in the 
art and include cauliflower mosaic virus (CaMV), geminivirus, brome mosaic virus, 
and tobacco mosaic virus. 

A polynucleotide of the present invention can be expressed in either sense 
15 or anti-sense orientation as desired. In plant cells, it has been shown that 

antisense RNA inhibits gene expression by preventing the accumulation of mRNA 
which encodes the enzyme of interest, see, e.g., Sheehy et al., Proc. Natl. Acad. 
Sci. USA 85:8805-8809 (1988); and Hiatt etal., U.S. Patent No. 4,801,340. 
Another method of suppression is sense suppression. Introduction of 
20 nucleic acid configured in the sense orientation has been shown to be an 
effective means by which to block the transcription of target genes. For an 
example of the use of this method to modulate expression of endogenous genes 
see, Napoli et al., The Plant Cell 2:279-289 (1990) and U.S. Patent No. 
5,034,323. 

25 Recent work has shown suppression with the use of double stranded RNA 

Such work is described in Tabara et al., Science 282:5388:430-431 (1998). 

Catalytic RNA molecules or ribozymes can also be used to inhibit 
expression of plant genes. The inclusion of ribozyme sequences within antisense 
RNAs confers RNA-cleaving activity upon them, thereby increasing the activity of 
30 the constructs. The design and use of target RNA-specific ribozymes is 
described in Haseloff et al., Nature 334:585-591 (1988). 

A variety of cross-linking agents, alkylating agents and radical generating 
species as pendant groups on polynucleotides of the present invention can be 
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used to bind, label, detect, and/or cleave nucleic acids. For example, Vlassov, V. 
V., et ai, Nucleic Adds Res (1986) 14:4065-4076, describe covalent bonding of a 
single-stranded DNA fragment with alkylating derivatives of nucleotides 
complementary to target sequences. A report of similar work by the same group 

5 is that by Knorre, D. G., et ai, Biochimie (1985) 67:785-789. Iverson and Dervan 
also showed sequence-specific cleavage of single-stranded DNA mediated by 
incorporation of a modified nucleotide which was capable of activating cleavage 
(J. Am. Chem. Soc. (1987) 109:1241-1243). Meyer, R. B., etai, J. Am. Chem. 
Soc. (1989) 111:8517-8519, effect covalent crosslinking to a target nucleotide 

1 0 using an alkylating agent complementary to the single-stranded target nucleotide 
sequence. A photoactivated crosslinking to single-stranded oligonucleotides 
mediated by psoralen was disclosed by Lee, B. L, et ai, Biochemistry (1988) 
27:3197-3203. Use of crosslinking in triple-helix forming probes was also 
disclosed by Home, et ai, J. Am. Chem. Soc. (1990) 1 12:2435-2437. Use of N4, 

1 5 N4-ethanocytosine as an alkylating agent to crosslink to single-stranded 

oligonucleotides has also been described by Webb and Matteucci, J. Am. Chem. 
Soc. (1986) 108:2764-2765; Nucleic Acids Res ( 1986) 14:7661-7674; Feteritz et 
ai, J. Am. Chem. Soc. 113:4000 (1991). Various compounds to bind, detect, 
label, and/or cleave nucleic acids are known in the art. See, for example, U.S. 

20 Patent Nos. 5,543,507; 5,672,593; 5,484,908; 5,256,648; and, 5,681 941 . 

Proteins 

In another aspect, the invention relates to an isolated protein comprising a 
member selected from the group consisting of: 
25 (a) a polypeptide comprising at least 25 contiguous amino acids of SEQ 

ID NO: 2, 8, 10, 12, 14, 16, 18, 20, or 22; 
(b) a polypeptide which is a plant HAP3-type CCATT-box binding 
transcriptional activator that regulates gene expression during 
embryo development and maturation; 
30 (c) a polypeptide comprising at least 60% sequence identity to SEQ ID 

NO: 2, 12, 14, 16, 20, or 22, or 70 % sequence identity to SEQ ID NO: 
8, 10, or 18, wherein the % sequence identity is based on the entire 
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sequence and is determined by GAP analysis using Gap Weight of 12 
and Length Weight of 4; 

(d) a polypeptide encoded by a nucleic acid of claim 1 ; 

(e) a polypeptide encoded by a nucleic acid of SEQ ID NO: 1,7, 9, 11, 
5 13, 15, 17, 19, or 21; and 

(f) a polypeptide having the sequence set forth in SEQ ID NO: 2, 8, 10, 
12, 14, 16, 18, 20, or 22. 

Proteins of the present invention include proteins derived from the native 

1 0 protein by deletion (so-called truncation), addition or substitution of one or more 
amino acids at one or more sites in the native protein. Such variants may result 
from, for example, genetic polymorphism or from human manipulation. Methods 
for such manipulations are generally known in the art. 

For example, amino acid sequence variants of the polypeptide can be 

1 5 prepared by mutations in the cloned DNA sequence encoding the native protein 
of interest. Methods for mutagenesis and nucleotide sequence alterations are 
well known in the art. See, for example, Walker and Gaastra, eds. (1983) 
Techniques in Molecular Biology (MacMillan Publishing Company, New York); 
Kunkel (1985) Proc. Natl. Acad. Sci. USA 82:488-492; Kunkel etal. (1987) 

20 Methods Enzymol. 1 54:367-382; Sambrook et al. (1 989) Molecular Cloning: A 
Laboratory Manual (Cold Spring Harbor, New York); U.S. Patent No. 4,873,192; 
and the references cited therein; herein incorporated by reference. Guidance as 
to appropriate amino acid substitutions that do not affect biological activity of the 
protein of interest may be found in the model of Dayhoff et al. (1 978) Atlas of 

25 Protein Sequence and 

Structure (Natl. Biomed. Res. Found., Washington, D.C.), herein incorporated by 
reference. Conservative substitutions, such as exchanging one amino acid with 
another having similar properties, may be preferred. 

In constructing variants of the proteins of interest, modifications to the 

30 nucleotide sequences encoding the variants will be made such that variants 
continue to possess the desired activity. Obviously, any mutations made in the 
DNA encoding the variant protein must not place the sequence out of reading 
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frame and preferably will not create complementary regions that could produce 
secondary mRNA structure. See EP Patent Application Publication No. 75,444. 

The isolated proteins of the present invention include a polypeptide 
comprising at least 23 contiguous amino acids encoded by any one of the nucleic 
5 acids of the present invention, or polypeptides which are conservatively modified 
variants thereof. The proteins of the present invention or variants thereof can 
comprise any number of contiguous amino acid residues from a polypeptide of the 
present invention, wherein that number is selected from the group of integers 
consisting of from 23 to the number of residues in a full-length polypeptide of the 
10 present invention. Optionally, this subsequence of contiguous amino acids is at 
least 25, 30, 35, or 40 amino acids in length, often at least 50, 60, 70, 80, or 90 
amino acids in length. 

The present invention includes catalytically active polypeptides (i.e., 
enzymes). Catalytically active polypeptides will generally have a specific activity 
15 of at least 20%, 30%, or 40%, and preferably at least 50%, 60%, or 70%, and 
most preferably at least 80%, 90%, or 95% that of the native (non-synthetic), 
endogenous polypeptide. Further, the substrate specificity (IWKn) is optionally 
substantially similar to the native (non-synthetic), endogenous polypeptide. 
Typically, the K*, will be at least 30%, 40%, or 50%, that of the native (non- 
20 synthetic), endogenous polypeptide; and more preferably at least 60%, 70%, 
80%, or 90%. Methods of assaying and quantifying measures of enzymatic 
activity and substrate specificity (WKn,), are well known to those of skill in the 
art. 

The present invention includes modifications that can be made to an 
25 inventive protein. In particular, it may be desirable to diminish the activity of the 
LEC1 gene. Other modifications may be made to facilitate the cloning, 
expression, or incorporation of the targeting molecule into a fusion protein. Such 
modifications are well known to those of skill in the art and include, for example, a 
methionine added at the amino terminus to provide an initiation site, or additional 
30 amino acids (e.g., poly His) placed on either terminus to create conveniently 
located restriction sites or termination codons or purification sequences. 

Using the nucleic acids of the present invention, one may express a protein 
of the present invention in a recombinantly engineered cell such as bacteria, 
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yeast, insect, mammalian, or preferably plant cells. The cells produce the protein 
in a non-natural condition (e.g., in quantity, composition, location, and/or time), 
because they have been genetically altered through human intervention to do so. 
Typically, an intermediate host cell will be used in the practice of this 
5 invention to increase the copy number of the cloning vector. With an increased 
copy number, the vector containing the gene of interest can be isolated in 
significant quantities for introduction into the desired plant cells. 

Host cells that can be used in the practice of this invention include 
prokaryotes, including bacterial hosts such as Eschericia coli, Salmonella 
1 0 typhimurium, and Serratia marcescens. Eukaryotic hosts such as yeast or 

filamentous fungi may also be used in this invention. Since these hosts are also 
microorganisms, it will be essential to ensure that plant promoters which do not 
cause expression of the polypeptide in bacteria are used in the vector. 

Commonly used prokaryotic control sequences include such commonly 
15 used promoters as the beta lactamase (penicillinase) and lactose (lac) promoter 
systems (Chang etal., Nature 198:1056 (1977)), the tryptophan (trp) promoter 
system (Goeddel et a/., Nucleic Acids Res. 8:4057 (1980)) and the lambda 
derived P L promoter and N-gene ribosome binding site (Shimatake et a/., Nature 
292:128 (1981)). The inclusion of selection markers in DNA vectors transfected 
20 in E coli is also useful. Examples of such markers include genes specifying 
resistance to ampicillin, tetracycline, or chloramphenicol. 

The vector is selected to allow introduction into the appropriate host cell. 
Bacterial vectors are typically of plasmid or phage origin. Expression systems for 
expressing a protein of the present invention are available using Bacillus sp. and 
25 Salmonella (Palva, et a/., Gene 22:229-235 (1983); Mosbach, et a/., Nature 
302:543-545 (1983)). 

Synthesis of heterologous proteins in yeast is well known. See Sherman, 
F., etal., Methods in Yeast Genetics, Cold Spring Harbor Laboratory (1982). Two 
widely utilized yeast for production of eukaryotic proteins are Saccharomyces 
30 cerevisiae and Pichia pastohs. Vectors, strains, and protocols for expression in 
Saccharomyces and Pichia are known in the art and available from commercial 
suppliers (e.g., Invitrogen). Suitable vectors usually have expression control 
sequences, such as promoters, including 3-phosphoglycerate kinase or alcohol 
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oxidase, and an origin of replication, termination sequences and the like as 
desired. 

A protein of the present invention, once expressed, can be isolated from 
yeast by lysing the cells and applying standard protein isolation techniques to the 
lysates. The monitoring of the purification process can be accomplished by using 
Western blot techniques or radioimmunoassay of other standard immunoassay 
techniques. 

The proteins of the present invention can also be constructed using non- 
cellular synthetic methods. Solid phase synthesis of proteins of less than about 
50 amino acids in length may be accomplished by attaching the C-terminal amino 
acid of the sequence to an insoluble support followed by sequential addition of 
the remaining amino acids in the sequence. Techniques for solid phase synthesis 
are described by Barany and Merrifield, Solid-Phase Peptide Synthesis, pp. 3-284 
in The Peptides: Analysis, Synthesis, Biology. Vol. 2: Special Methods in Peptide 
Synthesis, Part A.; Merrifield, etai, J. Am. Chem. Soc. 85:2149-2156 (1963), and 
Stewart et al, Solid Phase Peptide Synthesis, 2nd ed., Pierce Chem. Co., 
Rockford, III. (1984). Proteins of greater length may be synthesized by 
condensation of the amino and carboxy termini of shorter fragments. Methods of 
forming peptide bonds by activation of a carboxy terminal end (e.g., by the use of 
the coupling reagent N.N'-dicycylohexylcarbodiimide)) is known to those of skill. 

The proteins of this invention, recombinant or synthetic, may be purified to 
substantial purity by standard techniques well known in the art, including 
detergent solubilization, selective precipitation with such substances as 
ammonium sulfate, column chromatography, immunopurification methods, and 
others. See, for instance, R. Scopes, Protein Purification: Principles and Practice, 
Springer-Verlag: New York (1982); Deutscher, Guide to Protein Purification, 
Academic Press (1990). For example, antibodies may be raised to the proteins 
as described herein. Purification from £ coli can be achieved following 
procedures described in U.S. Patent No. 4,511,503. Detection of the expressed 
protein is achieved by methods known in the art and include, for example, 
radioimmunoassays, Western blotting techniques or immunoprecipitation. 

The present invention further provides a method for modulating (i.e., 
increasing or decreasing) the concentration or composition of the polypeptides of 
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the present invention in a plant or part thereof. Modulation can be effected by 
increasing or decreasing the concentration and/or the composition (i.e., the ratio 
of the polypeptides of the present invention) in a plant. 

The method comprises transforming a plant cell with an expression 

5 cassette comprising a polynucleotide of the present invention to obtain a 
transformed plant cell, growing the transformed plant cell under conditions 
allowing expression of the polynucleotide in the plant cell in an amount sufficient 
to modulate concentration and/or composition in the plant cell. 

In some embodiments, the content and/or composition of polypeptides of 

10 the present invention in a plant may be modulated by altering, in vivo or in vitro, 
the promoter of a non-isolated gene of the present invention to up- or down- 
regulate gene expression. In some embodiments, the coding regions of native 
genes of the present invention can be altered via substitution, addition, insertion, 
or deletion to decrease activity of the encoded enzyme. See, e.g., Kmiec, U.S. 

1 5 Patent 5,565,350; Zarling et a/., PCT/US93/03868. One method of down- 
regulation of the protein involves using PEST sequences that provide a target for 
degradation of the protein. It has been observed that high levels of LEC1 prevent 
germination. See Lotan etal., Cell 1998 June 26; 93(7):1 195-1205. Thus, 
temporal regulation of LEC1 expression may be desirable in certain species to 

20 permit proper germination, vegetative growth, flowering and reproduction. 

In some embodiments, an isolated nucleic acid (e.g., a vector) comprising 
a promoter sequence is transfected into a plant cell. Subsequently, a plant cell 
comprising the promoter operably linked to a polynucleotide of the present 
invention is selected for by means known to those of skill in the art such as, but 

25 not limited to, Southern blot, DNA sequencing, or PCR analysis using primers 
specific to the promoter and to the gene and detecting amplicons produced 
therefrom. A plant or plant part altered or modified by the foregoing embodiments 
is grown under plant forming conditions for a time sufficient to modulate the 
concentration and/or composition of polypeptides of the present invention in the 

30 plant. Plant forming conditions are well known in the art. 

In general, concentration or composition is increased or decreased by at 
least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% relative to a 
native control plant, plant part, or cell lacking the aforementioned expression 
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cassette. Modulation in the present invention may occur during and/or 
subsequent to growth of the plant to the desired stage of development. 
Modulating nucleic acid expression temporally and/or in particular tissues can be 
controlled by employing the appropriate promoter operably linked to a 

5 polynucleotide of the present invention in, for example, sense or antisense 
orientation as discussed in greater detail, supra. Induction of expression of a 
polynucleotide of the present invention can also be controlled by exogenous 
administration of an effective emount of inducing compound. Inducible promoters 
and inducing compounds which activate expression from these promoters are well 

1 0 known in the art. In preferred embodiments, the polypeptides of the present 
invention are modulated in monocots or dicots, preferably maize, soybeans, 
sunflower, sorghum, canola, wheat, alfalfa, rice, barley and millet 

Means of detecting the proteins of the present invention are not critical 
aspects of the present invention. In a preferred embodiment, the proteins are 

1 5 detected and/or quantified using any of a number of well recognized 

immunological binding assays (see, e.g., U.S. Patents 4,366,241; 4,376,110; 
4,517,288; and 4,837,168). For a review of the general immunoassays, see also 
Methods in Cell Biology, Vol. 37: Antibodies in Cell Biology, Asai, Ed., Academic 
Press, Inc. New York (1993); Basic and Clinical Immunology 7th Edition, Stites & 

20 Terr, Eds. (1991). Moreover, the immunoassays of the present invention can be 
performed in any of several configurations, e.g., those reviewed in Enzyme 
Immunoassay, Maggio, Ed., CRC Press, Boca Raton, Florida (1980); Tijan, 
Practice and Theory of Enzyme Immunoassays, Laboratory Techniques in 
Biochemistry and Molecular Biology, Elsevier Science Publishers B.V., 

25 Amsterdam (1 985); Harlow and Lane, supra; Immunoassay: A Practical Guide, 
Chan, Ed., Academic Press, Orlando, FL (1987); Principles and Practice of 
Immunoassays, Price and Newman Eds., Stockton Press, NY (1991); and Non- 
isotopic Immunoassays, Ngo, Ed., Plenum Press, NY (1988). 

Typical methods include Western blot (immunoblot) analysis, analytic 

30 biochemical methods such as electrophoresis, capillary electrophoresis, high 
performance liquid chromatography (HPLC), thin layer chromatography (TLC), 
hyperdiffusion chromatography, and the like, and various immunological methods 
such as fluid or gel precipitin reactions, immunodiffusion (single or double), 
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immunoelectrophoresis, radioimmunoassays (RlAs), enzyme-linked 
immunosorbent assays (ELISAs), immunofluorescent assays, and the like. 

Non-radioactive labels are often attached by indirect means. Generally, a 
ligand molecule (e.g., biotin) is covalently bound to the molecule. The ligand then 

5 binds to an anti-ligand (e.g. , streptavidin) molecule which is either inherently 
detectable or covalently bound to a signal system, such as a detectable enzyme, 
a fluorescent compound, or a chemiluminescent compound. A number of ligands 
and anti-ligands can be used. Where a ligand has a natural anti-ligand, for 
example, biotin, thyroxine, and Cortisol, it can be used in conjunction with the 

10 labeled, naturally occurring anti-ligands. Alternatively, any haptenic or antigenic 
compound can be used in combination with an antibody. 

The molecules can also be conjugated directly to signal generating 
compounds, e.g., by conjugation with an enzyme or fluorophore. Enzymes of 
interest as labels will primarily be hydrolases, particularly phosphatases, 

1 5 esterases and glycosidases, or oxidoreductases, particularly peroxidases. 

Fluorescent compounds include fluorescein and its derivatives, rhodamine and its 
derivatives, dansyl, umbelliferone, etc. Chemiluminescent compounds include 
luciferin, and 2,3-dihydrophthalazinediones, e.g., luminol. For a review of various 
labeling or signal producing systems which may be used, see, U.S. Patent No. 

20 4,391 ,904, which is incorporated herein by reference. 

Some assay formats do not require the use of labeled components. For 
instance, agglutination assays can be used to detect the presence of the target 
antibodies. In this case, antigen-coated particles are agglutinated by samples 
comprising the target antibodies. In this format, none of the components need be 

25 labeled and the presence of the target antibody is detected by simple visual 
inspection. 

The proteins of the present invention can be used for identifying 
compounds that bind to (e.g., substrates), and/or increase or decrease (i.e., 
modulate) the enzymatic activity of, catalytically active polypeptides of the present 
30 invention. The method comprises contacting a polypeptide of the present 

invention with a compound whose ability to bind to or modulate enzyme activity is 
to be determined. The polypeptide employed will have at least 20%, preferably at 
least 30% or 40%, more preferably at least 50% or 60%, and most preferably at 
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least 70% or 80% of the specific activity of the native, full-length polypeptide of 
the present invention (e.g., enzyme). Methods of measuring enzyme kinetics are 
well known in the art. See, e.g., Segel, Biochemical Calculations, 2 nd ed., John 
Wiley and Sons, New York (1976). 

Antibodies can be raised to a protein of the present invention, including 
individual, allelic, strain, or species variants, and fragments thereof, both in their 
naturally occurring (full-length) forms and in recombinant forms. Additionally, 
antibodies are raised to these proteins in either their native configurations or in 
non-native configurations. Anti-idiotypic antibodies can also be generated. Many 
methods of making antibodies are known to persons of skill. 

In some instances, it is desirable to prepare monoclonal antibodies from 
various mammalian hosts, such as mice, rodents, primates, humans, etc. 
Description of techniques for preparing such monoclonal antibodies are found in, 
e.g., Basic and Clinical Immunology, 4th ed., Stites ef a/., Eds., Lange Medical 
Publications, Los Altos, CA, and references cited therein; Harlow and Lane, 
Supra; Goding, Monoclonal Antibodies: Principles and Practice, 2nd ed., 
Academic Press, New York, NY (1986); and Kohler and Milstein, Nature 256:495- 
497 (1975). 

Other suitable techniques involve selection of libraries of recombinant 
antibodies in phage or similar vectors (see, e.g., Huse ef a/., Science 246:1275- 
1281 (1989); and Ward, ef a/., Nature 341 :544-546 (1989); and Vaughan et at., 
Nature Biotechnology, 14:309-314 (1996)). Alternatively, high avidity human 
monoclonal antibodies can be obtained from transgenic mice comprising 
fragments of the prearranged human heavy and light chain Ig loci (i.e., minilocus 
transgenic mice). Fishwild etal., Nature Biotech., 14:845-851 (1996). Also, 
recombinant immunoglobulins may be produced. See, Cabilly, U.S. Patent No. 
4,816,567; and Queen etal., Proc. Natl. Acad. Sci. 86:10029-10033 (1989). 

The antibodies of this invention can be used for affinity chromatography in 
isolating proteins of the present invention, for screening expression libraries for 
particular expression products such as normal or abnormal protein or for raising 
anti-idiotypic antibodies which are useful for detecting or diagnosing various 
pathological conditions related to the presence of the respective antigens. 
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Frequently, the proteins and antibodies of the present invention will be 
labeled by joining, either covalently or non-covalently, a substance which 
provides for a detectable signal. A wide variety of labels and conjugation 
techniques are known and are reported extensively in both the scientific and 
patent literature. Suitable labels include radionucleotides, enzymes, substrates, 
cofactors, inhibitors, fluorescent moieties, chemiluminescent moieties, magnetic 
particles, and the like. 

Transf ection/Transf ormation of Cells 

The method of transformation/transfection is not critical to the instant 
invention; various methods of transformation or transfection are currently 
available. As newer methods are available to transform crops or other host cells 
they may be directly applied. Accordingly, a wide variety of methods have been 
developed to insert a DNA sequence into the genome of a host cell to obtain the 
transcription and/or translation of the sequence to effect phenotypic changes in 
the organism. Thus, any method which provides for efficient 
transformation/transfection may be employed. 

A DNA sequence coding for the desired polynucleotide of the present 
invention, for example a cDNA or a genomic sequence encoding a full length 
protein, can be used to construct an expression cassette which can be introduced 
into the desired plant. Isolated nucleic acid acids of the present invention can be 
introduced into plants according techniques known in the art. Generally, 
expression cassettes as described above and suitable for transformation of plant 

cells are prepared. 

Techniques for transforming a wide variety of higher plant species are well 
known and described in the technical, scientific, and patent literature. See, for 
example, Weising etal., Ann. Rev. Genet. 22:421-477 (1988). For example, the 
DNA construct may be introduced directly into the genomic DNA of the plant cell 
using techniques such as electroporation, PEG poration, particle bombardment, 
silicon fiber delivery, or microinjection of plant cell protoplasts or embryogenic 
callus. See, e.g., Tomes, et a/., Direct DNA Transfer into Intact Plant Cells Via 
Microprojectile Bombardment, pp.197-213 in Plant Cell, Tissue and Organ 
Culture, Fundamental Methods, eds. 0. L. Gamborg and G.C. Phillips. Springer- 
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Verlag Berlin Heidelberg New York, 1995. Alternatively, the DNA constructs may 
be combined with suitable T-DNA flanking regions and introduced into a 
conventional Agmbacterium tumefaciens host vector. The virulence functions of 
the Agmbacterium tumefaciens host will direct the insertion of the construct and 

5 adjacent marker into the plant cell DNA when the cell is infected by the bacteria. 
See, U.S. Patent No. 5,591,616. 

The introduction of DNA constructs using polyethylene glycol precipitation 
is described in Paszkowski et al, Embo J. 3:2717-2722 (1984). Electroporation 
techniques are described in Fromm etal., Pmc. Natl. Acad. Sci. 82:5824 (1985). 

1 0 Ballistic transformation techniques are described in Klein et al., Nature 327:70-73 

(1987) . 

Agmbacterium fume/ac/e/?s-meditated transformation techniques are well 
described in the scientific literature. See, for example Horsch ef al., Science 
233:496-498 (1984), and Fraley etal., Pmc. Natl. Acad. Sci. 80:4803 (1983). For 

1 5 instance, Agmbacterium transformation of maize is described in WO 98/32326. 
Agmbacterium transformation of soybean is described in US Pat. No. 5,563,055. 

Other methods of transfection or transformation include (1 ) Agmbacterium 
rfwzogenes-mediated transformation (see, e.g., Lichtenstein and Fuller In: Genetic 
Engineering, Vol. 6, PWJ Rigby, Ed., London, Academic Press, 1987; and 

20 Lichtenstein, C. P., and Draper, J,. In: DNA Cloning, Vol. II, D. M. Glover, Ed., 
Oxford, IRI Press, 1985), Application PCT/US87/02512 (WO 88/02405 published 
Apr. 7, 1988) describes the use of A rhizogenes strain A4 and its Ri plasmid 
along with A. tumefaciens vectors pARC8 or pARC16 (2) liposome-mediated DNA 
uptake (see, e.g., Freeman etal., Plant Cell Physiol. 25:1353, (1984)), (3) the 

25 vortexing method (see, e.g., Kindle, Proc. Natl. Acad. Sci. USA 87:1228, (1990)). 

DNA can also be introduced into plants by direct DNA transfer into pollen 
as described by Zhou et al., Methods in Enzymology, 101:433 (1983); D. Hess, 
Intern Rev. Cytol., 107:367 (1987); Luo etal., Plane Mol. Biol. Reporter, 6:165 

(1988) . Expression of polypeptide coding polynucleotides can be obtained by 
30 injection of the DNA into reproductive organs of a plant as described by Pena et 

al., Nature, 325:274 (1987). DNA can also be injected directly into the cells of 
immature embryos and the rehydration of desiccated embryos as described by 
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Neuhaus etai, Theor. Appl- Genet, 75:30 (1987); and Benbrook etal., in 
Proceedings Bio Expo 1986, Butterworth, Stoneham, Mass., pp. 27-54 (1986). 

Animal and lower eukaryotic (e.g., yeast) host cells are competent or 
rendered competent for transfection by various means. There are several well- 

5 known methods of introducing DNA into animal cells. These include: calcium 
phosphate precipitation, fusion of the recipient cells with bacterial protoplasts 
containing the DNA, treatment of the recipient cells with liposomes containing the 
DNA, DEAE dextran, electroporation, biolistics, and micro-injection of the DNA 
directly into the cells. The transfected cells are cultured by means well known in 

1 0 the art. Kuchler, R. J., Biochemical Methods in Cell Culture and Virology, Dowden, 
Hutchinson and Ross, Inc. (1977). 

Altering the culture medium to suppress somatic embryogenesis in non- 
transformed plant cells and/or tissues to provide for a positive section 
1 5 means of transformed plant cells 

Using the following methods for controlling somatic embryogenesis, it is 
possible to alter plant tissue culture media components to suppress somatic 
embryogenesis in a plant species of interest (often having multiple components 
that potentially could be adjusted to impart this effect). Such conditions would 

20 not impart a negative or toxic in vitro environment for wild-type tissue, but instead 
would simply not produce a somatic embryogenic growth form. Introducing a 
transgene such as LEC1 will stimulate somatic embryogenesis and growth in the 
transformed cells or tissue, providing a clear differential growth screen useful for 
identifying transformants. 

25 Altering a wide variety of media components can modulate somatic 

embryogenesis (either stimulating or suppressing embryogenesis depending on 
the species and particular media component). Examples of media components 
which, when altered, can stimulate or suppress somatic embryogenesis include; 

1) the basal medium itself (macronutrient, micronutrients and vitamins; see T.A. 
30 Thorpe, 1 981 for review, "Plant Tissue Culture: Methods and Applications in 

Agriculture", Academic Press, NY), 

2) plant phytohormones such as auxins (indole acetic acid, indole butyric acid, 
2,4-dichlorophenoxyacetic acid, naphthaleneacetic acid, picloram, dicamba 
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and other functional analogues), cytokinins (zeatin, kinetin, benzyl amino 
purine, 2-isopentyl adenine and functionally-related compounds) abscisic acid, 
adenine, and gibberellic acid, 

3) and other compounds that exert "growth regulator" effects such as coconut 
water, casein hydrolysate, and proline, and 

4) the type and concentration of gelling agent, pH and sucrose concentration. 
Changes in the individual components listed above (or in some cases 

combinations of components) have been demonstrated in the literature to 
modulate in vitro somatic embryogenesis across a wide range of dicotyledonous 
and monocotyledonous species. For a compilation of examples, see E.F. George 
etal. 1987. Plant Tissue Culture Media. Vol. 1: Formulations and Uses. 
Exergetics, Ltd., Publ., Edington, England. 

Transgenic Plant Regeneration 

Transformed plant cells which are derived by any of the above 
transformation techniques can be cultured to regenerate a whole plant which 
possesses the transformed genotype. Such regeneration techniques often rely or 
manipulation of certain phytohormones in a tissue culture growth medium, 
typically relying on a biocide and/or herbicide marker that has been introduced 
together with a polynucleotide of the present invention. For transformation and 
regeneration of maize see, Gordon-Kamm etal., The Plant Cell, 2:603-618 
(1990). 

Plants cells transformed with a plant expression vector can be 
regenerated, e.g., from single cells, callus tissue or leaf discs according to 
standard plant tissue culture techniques. It is well known in the art that various 
cells, tissues, and organs from almost any plant can be successfully cultured to 
regenerate an entire plant. Plant regeneration from cultured protoplasts is 
described in Evans et al, Protoplasts Isolation and Culture, Handbook of Plant 
Cell Culture, Macmillan Publishing Company, New York, pp. 124-176 (1983); and 
Binding, Regeneration of Plants, Plant Protoplasts, CRC Press, Boca Raton, pp. 
21-73(1985). 

The regeneration of plants containing the foreign gene introduced by 
Agrobacterium can be achieved as described by Horsch et al., Science, 
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227:1229-1231 (1985) and Fraley et at, Proc. Natl. Acad. Sd. U.S.A. 80:4803 
(1983). This procedure typically produces shoots within two to four weeks and 
these transformant shoots are then transferred to an appropriate root-inducing 
medium containing the selective agent and an antibiotic to prevent bacterial 
5 growth. Transgenic plants of the present invention may be fertile or sterile. 

Regeneration can also be obtained from plant callus, explants, organs, or 
parts thereof. Such regeneration techniques are described generally in Klee et 
ai, Ann. Rev. of Plant Phys. 38:467-486 (1987). The regeneration of plants from 
either single plant protoplasts or various explants is well known in the art. See, 

1 0 for example, Methods for Plant Molecular Biology, A. Weissbach and H. 

Weissbach, eds., Academic Press, Inc., San Diego, Calif. (1988). For maize cell 
culture and regeneration see generally, The Maize Handbook, Freeling and 
Walbot, Eds., Springer, New York (1994); Corn and Corn Improvement, 3* 
edition, Sprague and Dudley Eds., American Society of Agronomy, Madison, 

15 Wisconsin (1988). 

One of skill will recognize that after the expression cassette is stably 
incorporated in transgenic plants and confirmed to be operable, it can be 
introduced into other plants by sexual crossing. Any of a number of standard 
breeding techniques can be used, depending upon the species to be crossed. 

20 In vegetatively propagated crops, mature transgenic plants can be 

propagated by the taking of cuttings, via production of apomictic seed, or by 
tissue culture techniques to produce multiple identical plants. Selection of 
desirable transgenics is made and new varieties are obtained and propagated 
vegetatively for commercial use. In seed propagated crops, mature transgenic 

25 plants can be self crossed to produce a homozygous inbred plant. The inbred 
plant produces seed containing the newly introduced heterologous nucleic acid. 
These seeds can be grown to produce plants that would produce the selected 
phenotype. 

Parts obtained from the regenerated plant, such as flowers, seeds, leaves, 
30 branches, fruit, and the like are included in the invention, provided that these 
parts comprise cells comprising the isolated nucleic acid of the present invention. 
Progeny and variants, and mutants of the regenerated plants are also included 
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within the scope of the invention, provided that these parts comprise the 
introduced nucleic acid sequences. 

Transgenic plants expressing a selectable marker can be screened for 
transmission of the nucleic acid of the present invention by, for example, standard 

5 immunoblot and DNA detection techniques. Transgenic lines are also typically 
evaluated on levels of expression of the heterologous nucleic acid. Expression at 
the RNA level can be determined initially to identify and quantitate expression- 
positive plants. Standard techniques for RNA analysis can be employed and 
include PCR amplification assays using oligonucleotide primers designed to 

1 0 amplify only the heterologous RNA templates and solution hybridization assays 
using heterologous nucleic acid-specific probes. The RNA-positive plants can 
then analyzed for protein expression by Western immunoblot analysis using the 
specifically reactive antibodies of the present invention. In addition, in situ 
hybridization and immunocytochemistry according to standard protocols can be 

1 5 done using heterologous nucleic acid specific polynucleotide probes and 

antibodies, respectively, to localize sites of expression within transgenic tissue. 
Generally, a number of transgenic lines are usually screened for the incorporated 
nucleic acid to identify and select plants with the most appropriate expression 
profiles. 

20 A preferred embodiment is a transgenic plant that is homozygous for the 

added heterologous nucleic acid; i.e., a transgenic plant that contains two added 
nucleic acid sequences, one gene at the same locus on each chromosome of a 
chromosome pair. A homozygous transgenic plant can be obtained by sexually 
mating (selfing) a heterozygous transgenic plant that contains a single added 

25 heterologous nucleic acid, germinating some of the seed produced and analyzing 
the resulting plants produced for altered expression of a polynucleotide of the 
present invention relative to a control plant (i.e., native, non-transgenic). Back- 
crossing to a parental plant and out-crossing with a non- transgenic plant are also 
contemplated. Alternatively, propagation of heterozygous transgenic plants could 

30 be accomplished through apomixis. 

The present invention provides a method of genotyping a plant comprising 
a polynucleotide of the present invention. Genotyping provides a means of 
distinguishing homologs of a chromosome pair and can be used to differentiate 
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segregants in a plant population. Molecular marker methods can be used for 
phylogenetic studies, characterizing genetic relationships among crop varieties, 
identifying crosses or somatic hybrids, localizing chromosomal segments affecting 
monogenic traits, map based cloning, and the study of quantitative inheritance. 

5 See, e.g., Plant Molecular Biology: A Laboratory Manual, Chapter 7, Clark, Ed., 
Springer-Verlag. Berlin (1997). For molecular marker methods, see generally, 
The DNA Revolution by Andrew H. Paterson 1996 (Chapter 2) in: Genome 
Mapping in Plants (ed. Andrew H. Paterson) by Academic Press/R. G. Landis 
Company, Austin, Texas, pp.7-21. 

1 o The particular method of genotyping in the present invention may employ 

any number of molecular marker analytic techniques such as, but not limited to, 
restriction fragment length polymorphisms (RFLPs). RFLPs are the product of 
allelic differences between DNA restriction fragments caused by nucleotide 
sequence variability. Thus, the present invention further provides a means to 

1 5 follow segregation of a gene or nucleic acid of the present invention as well as 
chromosomal sequences genetically linked to these genes or nucleic acids using 
such techniques as RFLP analysis. 

Plants which can be used in the method of the invention include 
monocotyledonous and dicotyledonous plants. Preferred plants include maize, 

20 wheat, rice, barley, oats, sorghum, millet, rye, soybean, sunflower, alfalfa, canola 
and cotton. 

Seeds derived from plants regenerated from transformed plant cells, plant 
parts or plant tissues, or progeny derived from the regenerated transformed 
plants, may be used directly as feed or food, or further processing may occur. 

25 All publications cited in this application are herein incorporated by 

reference to the same extent as if each individual publication or patent application 
was specifically and individually indicated to be incorporated by reference. 

The present invention will be further described by reference to the 
following detailed examples. It is understood, however, that there are many 

30 extensions, variations, and modifications on the basic theme of the present 
invention beyond that shown in the examples and description, which are within 
the spirit and scope of the present invention. 
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EXAMPLES 
Example 1 

Library construction used for the maize LEC1 EST's 

5 

A. Total RNA Isolation 

Total RNA was isolated from maize embryo and regenerating callus tissues 
with TRIzol Reagent (Life Technology Inc. Gaithersburg, MD) using a modification 
of the guanidine isothiocyanate/acid-phenol procedure described by Chomczynski 

10 and Sacchi (Chomczynski, P., and Sacchi, N. Anal. Biochem. 162, 156 (1987)). 
In brief, plant tissue samples were pulverized in liquid nitrogen before the addition 
of the TRIzol Reagent, and then were further homogenized with a mortar and 
pestle. Addition of chloroform followed by centrifugation was conducted for 
separation of an aqueous phase and an organic phase. The total RNA was 

1 5 recovered by precipitation with isopropyl alcohol from the aqueous phase. 

B. Poly(A)+ RNA Isolation 

The selection of poly(A)+ RNA from total RNA was performed using 
PolyATact system (Promega Corporation. Madison, Wl). In brief, biotinylated 
20 oligo(dT) primers were used to hybridize to the 3' poly(A) tails on mRNA. The 
hybrids were captured using streptavidin coupled to paramagnetic particles and a 
magnetic separation stand. The mRNA was washed at high stringent condition 
and eluted by RNase-free deionized water. 

25 C. cDNA Library Construction 

cDNA synthesis was performed and unidirectional cDNA libraries were 
constructed using the Superscript Plasmid System (Life Technology Inc. 
Gaithersburg, MD). The first stand of cDNA was synthesized by priming an 
oligo(dT) primer containing a Not I site. The reaction was catalyzed by 

30 Superscript Reverse Transcriptase 1 1 at 45°C. The second strand of cDNA was 
labeled with alpha-^P-dCTP and a portion of the reaction was analyzed by 
agarose gel electrophoresis to determine cDNA sizes. cDNA molecules smaller 
than 500 base pairs and unligated adapters were removed by Sephacryl-S400 
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chromatography. The selected cDNA molecules were ligated into pSPORTI 
vector in between of Not I and Sal I sites. 

Example 2 

5 Sequencing and cDNA subtraction procedures used for maize LEC1 ESTs 

A. Sequencing Template Preparation 

Individual colonies were picked and DNA was prepared either by PCR with 
M1 3 forward primers and M13 reverse primers, or by plasmid isolation. All the 
1 0 cDNA clones were sequenced using M1 3 reverse primers. 

B. Q-bot Subtraction Procedure 

cDNA libraries subjected to the subtraction procedure were plated out on 
22 x 22 cm 2 agar plate at density of about 3,000 colonies per plate. The plates 

1 5 were incubated in a 37°C incubator for 1 2-24 hours. Colonies were picked into 
384-well plates by a robot colony picker, Q-bot (GENETIX Limited). These plates 
were incubated overnight at 37°C. 

Once sufficient colonies were picked, they were pinned onto 22 x 22 cm 
nylon membranes using Q-bot. Each membrane contained 9,216 colonies or 

20 36,864 colonies. These membranes were placed onto agar plate with appropriate 
antibiotic. The plates were incubated at 37°C for overnight. 

After colonies were recovered on the second day, these filters were placed 
on filter paper prewetted with denaturing solution for four minutes, then were 
incubated on top of a boiling water bath for additional four minutes. The filters 

25 were then placed on filter paper prewetted with neutralizing solution for four 
minutes. After excess solution was removed by placing the filters on dry filter 
papers for one minute, the colony side of the filters were place into Proteinase K 
solution, incubated at 37°C for 40-50 minutes. The filters were placed on dry filter 
papers to dry overnight. DNA was then cross-linked to nylon membrane by UV 

30 light treatment. 

Colony hybridization was conducted as described by Sambrook, J., Fritsch, 
E.F. and Maniatis, T, (in Molecular Cloning: A laboratory Manual, 2 nd Edition). 
The following probes were used in colony hybridization: 
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1. First strand cDNA from the same tissue from which the library was made to 
remove the most redundant clones. 

2. 48-1 92 most redundant cDNA clones from the same library based on previous 
5 sequencing data. 

3. 192 most redundant cDNA clones in the entire corn sequence database. 

4. A Sal-A20 oligo nucleotide: TCG ACC CAC GCG TCC GAA AAA AAA AAA 
AAA AAA AAA removes clones containing a poly A tail but no cDNA. 

5. cDNA clones derived from rRNA 

10 

The image of the autoradiography was scanned into computer and the signal 
intensity and cold colony addresses of each colony was analyzed. Re-arraying of 
cold-colonies from 384 well plates to 96 well plates was conducted using Q-bot. 

15 Example 3 

Identification of Maize LEC1 EST's from a Computer Homology Search 

Gene identities were determined by conducting BLAST (Basic Local 
Alignment Search Tool; Altschul, S. F., etal, (1993) J. Mol. Biol. 215:403-410; 

20 see also www.ncbi.nlm.nih.gov/BLAST/) searches under default parameters for 
similarity to sequences contained in the BLAST "nr" database (comprising all non- 
redundant GenBank CDS translations, sequences derived from the 3-dimensional 
structure Brookhaven Protein Data Bank, the last major release of the SWISS- 
PROT protein sequence database, EMBL, and DDBJ databases). The cDNA 

25 sequences were analyzed for similarity to all publicly available DNA sequences 
contained in the "nr" database using the BLASTN algorithm. The DNA sequences 
were translated in all reading frames and compared for similarity to all publicly 
available protein sequences contained in the "nr" database using the BLASTX 
algorithm (Gish, W. and States, D. J. (1993) Nature Genetics 3:266-272) provided 

30 by the NCBI. In some cases, the sequencing data from two or more clones 

containing overlapping segments of DNA were used to construct contiguous DNA 
sequences. 
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EXAMPLE 4 

Composition of cDNA Libraries Used to Isolate and Sequence 
Additional cDNA Clones 

cDNA libraries representing mRNAs from various corn, poppy, soybean 
and Vernonia tissues were prepared (see Table 1). The characteristics of the 
libraries are described below. 

TABLE 1 

cDNA Libraries from Corn, Poppy, Soybean and Vernonia 
Tissue .- 



Corn endosperm 20 days After pollination* 

Prickly poppy developing seeds 

Soybean embryo, 6 to 10 days after flowering 

Soybean embryo, 13 days after flowering 

Soybean embryogenic suspension 2 weeks after 

subculture 

Soybean mature embryo 8 weeks after subculture 

Soybean embryogenic suspension 

vernnnia developing seed* 



These libraries were normalized essentially as described in U.S. Pat. No. 
5,482,845, incorporated herein by reference. 

cDNA libraries were prepared in Uni-ZAP™ XR vectors according to the 
manufacturer's protocol (Stratagene Cloning Systems, La Jolla, CA). Conversion 
of the Uni-ZAP™ XR libraries into plasmid libraries was accomplished according 
to the protocol provided by Stratagene. Upon conversion, cDNA inserts were 
contained in the plasmid vector pBluescript. cDNA inserts from randomly picked 
bacterial colonies containing recombinant pBluescript plasmids were amplified via 
polymerase chain reaction using primers specific for vector sequences flanking 
the inserted cDNA sequences or plasmid DNA was prepared from cultured 
bacterial cells. Amplified insert DNAs or plasmid DNAs were sequenced in dye- 
primer sequencing reactions to generate partial cDNA sequences (expressed 
sequence tags or 'ESTs»; see Adams, M. D. et ai, (1991) Science 252:1651). 
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The resulting ESTs were analyzed using a Perkin Elmer Model 377 fluorescent 
sequencer. 

EXAMPLE 5 

Identification of cDNA Clones obtained from tissue described in Table 1 

ESTs encoding plant transcription factors were identified by conducting 
BLAST (Basic Local Alignment Search Tool; Altschul, S. F., et a/., (1993) J. Mol. 
Biol. 21 5:403-410; see also www.ncbi.nlm.nih.gov/BLAST/) searches for similarity 
to sequences contained in the BLAST "nr" database (comprising all non- 
redundant GenBank CDS translations, sequences derived from the 3-dimensional 
structure Brookhaven Protein Data Bank, the last major release of the SWISS- 
PROT protein sequence database, EMBL, and DDBJ databases). The cDNA 
sequences obtained in Example 1 were analyzed for similarity to all publicly 
available DNA sequences contained in the "nr" database using the BLASTN 
algorithm provided by the National Center for Biotechnology Information (NCBI). 
The DNA sequences were translated in all reading frames and compared for 
similarity to all publicly available protein sequences contained in the "nr" 
database using the BLASTX algorithm (Gish, W. and States, D. J. (1993) Nature 
Genetics 3:266-272 and Altschul, Stephen F., etal. (1997) Nucleic Acids Res. 
25:3389-3402) provided by the NCBI. 

Example 6 

Identification of protein motifs diagnostic for LEC1 genes. 

To determine the structural requirements for a LEC1 gene, HAP3 
homologs were identified in our EST database and aligned. By analyzing 
sequence homology amongst the plant HAP3 family of transcriptional activators 
these sequences were observed to fall into a least two distinctive groups. All of 
the HAP3 sequences derived from seed or embryo specific libraries form a 
distinctive LEC1 group that suggests a common evolutionary origin (confirmed by 
phylodentrograms). For example within the "B domain" of all plant LEC1 
sequences examined, a highly conserved CCAAT-box binding motif has been 
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found to contain the non-variable residues methionine, proline, isoleucine, 
alanine, asparagine, valine, and isoleucine (MPIANVI). LEC1 genes are highly 
divergent outside of the region spanning the DNA binding and subunit interaction 
motifs. The low levels of homology between these genes make it difficult to 
identify these based solely on a hybridization strategy. Using sequences from 
maize, soybean, wheat, prickly poppy, Vernonia, and Arabidopsis a motif 
diagnostic for LEC1 genes was identified, the specific amino acid substitutions for 
these species was clarified (Figure 1) and the positions at which amino acid 
substitutions occur within the LEC1 group was determined (SEQ ID NO: 23). 
Using Blast, the motif in SEQ ID NO: 23 was used to correctly distinguish LECI's 
from other closely related plant HAP3 transcriptional activators. 

Example 7 

Transformation and Regeneration of Maize Callus 

Immature maize embryos from greenhouse or field grown High type II 
donor plants were bombarded with a plasmid containing a polynucleotide of the 
invention (LEC1). The LEC1 polynucleotide was operably linked to a constitutive 
promoter such as nos, or an inducible promoter, such as In2, plus a plasmid 
containing the selectable marker gene PAT (Wohlleben et a/. (1988) Gene 70:25- 
37) that confers resistance to the herbicide Bialaphos fused to the Green 
Fluorescence protein. Transformation was performed as follows. 

The ears were surface sterilized in 50% Chlorox bleach plus 0.5% Micro 
detergent for 20 minutes, and rinsed two times with sterile water. The immature 
embryos were excised and placed embryo axis side down (scutellum side up), 25 
embryos per plate. These were cultured on 560 L medium 4 days prior to 
bombardment in the dark Medium 560 L is an N6-based medium containing 
Eriksson's vitamins, thiamine, sucrose, 2,4-D, and silver nitrate. The day of 
bombardment, the embryos were transferred to 560 Y medium for 4 hours and 
were arranged within the 2.5-cm target zone. Medium 560Y is a high osmoticum 
medium (560L with high sucrose concentration). 

A plasmid vector comprising a polynucleotide of the invention operably 
linked to the selected promoter was constructed. This plasmid DNA plus plasmid 
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DNA containing a PAT selectable marker was precipitated onto 1.1^ (average 
diameter) tungsten pellets using a CaCI 2 precipitation procedure as follows: 100 
Hi prepared tungsten particles (0.6 mg) in water, 20 fil (2 ng) DNA in TrisEDTA 
buffer (1 ug total), 100 2.5 M CaC1 2 , 40 pJ 0.1 M spermidine. 

5 Each reagent was added sequentially to the tungsten particle suspension. 

The final mixture was sonicated briefly. After the precipitation period, the tubes 
were centrifuged briefly, liquid removed, washed with 500 ml 100% ethanol, and 
centrifuged again for 30 seconds. Again the liquid was removed, and 60 ul 100% 
ethanol was added to the final tungsten particle pellet. For particle gun 

1 0 bombardment, the tungsten/DNA particles were briefly sonicated and 5 \i\ spotted 
onto the center of each macrocarrier and allowed to dry about 2 minutes before 
bombardment. 

The sample plates were bombarded at a distance of 8 cm from the 
stopping screen to the tissue, using a Dupont biolistics helium particle gun. All 

15 samples received a single shot at 650 PSI, with a total of ten aliquots taken from 
each tube of prepared particles/DNA 

Four to 12 hours post bombardment, the embryos were moved to 560P (a 
low osmoticum callus initiation medium similar to 560L but with lower silver 
nitrate), for 3-7 days, then transferred to 560R selection medium, an N6 based 

20 medium similar to 560P containing 3 mg/liter Bialaphos, and subcultured every 2 
weeks. Multicellular GFP cell clusters became visible after two weeks and their 
numbers were periodically recorded. After approximately 10 weeks of selection, 
selection-resistant GFP positive callus clones were sampled for PCR and activity 
of the polynucleotide of interest. Positive lines were transferred to 288J medium, 

25 an MS-based medium with lower sucrose and hormone levels, to initiate plant 
regeneration. Following somatic embryo maturation (2-4 weeks), well-developed 
somatic embryos were transferred to medium for germination and transferred to 
the lighted culture room. Approximately 7-10 days later, developing plantlets 
were transferred to medium in tubes for 7-10 days until plantlets were well 

30 established. Plants were then transferred to inserts in flats (equivalent to 2.5" 
pot) containing potting soil and grown for 1 week in a growth chamber, 
subsequently grown an additional 1-2 weeks in the greenhouse, then transferred 
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to Classic™ 600 pots (1.6 gallon) and grown to maturity. Plants are monitored for 
expression of the polynucleotide of interest. 

Example 8 

5 Ectopic Expression of Maize LEC1 to Induce Somatic Embryogenesis 

Using the genotype High type II as an example, embryos were isolated and 
cultured on 560L medium for 3-5 days. Four to twelve hours before bombardment 
these embryos were transferred to high osmotic 560Y medium. Expression 

1 0 cassettes containing the LEC 1 cDNA were then co-introduced into the scutella of 
these embryos along with an expression cassette containing the Pat gene fused 
to the Green Fluorescent protein using methods described in Example 7. 
Embryos from a single ear were divided evenly between treatments. Four to 12 
hours following bombardment embryos were then transferred back to a low 

1 5 osmoticum callus initiation medium (560P) and incubated in the dark at 26°C. 
After 3-7 days of culture these embryos were moved to 560R selection medium. 
Cultures were then transferred every two weeks until transformed colonies 
appear. Cultures were also examined microscopically for GFP expression. LEC1 
expression was expected to stimulate adventive embryo formation. This was 

20 apparent when the cultures were compared to controls (transformed without the 
LEC1 cDNA or non-induced). 

A. Ectopic expression of the maize LEC1 polynucleotide in tobacco is 
sufficient to induce somatic embryogenesis in tobacco leaves 

25 A maize LEC1 polynucleotide was placed into an agrobacterium 

expression cassette driven by the maize safener-induced In2 promoter (this 
promoter is leaky and expresses at low levels without induction). Also between 
the left and right T-DNA borders was the bar gene driven by 35S promoter and 
the Green Fluorescence Protein driven by the ubiquitin promoter. A similar 

30 construct was made without the LEC1 polynucleotide to be used as a control. 
Tobacco leaf discs from variety SR1 were co-cultured with Agrobacterium as 
described by Horsch etal. (1985, Science 227:1229-1231) except selecting with 
bialaphos rather than kanamycin. Transformants were selected on medium 
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containing 3mg/l bialaphos. Of the numerous transformed shoot obtained, ectopic 
embryos were visible on the leaves of a single LEC1 transformant. None were 
visible on control plants. Although the frequency was low, ectopic somatic 
embryo formation was also reported to be a rare event in Arabidopsis LEC1 
5 transformants (Lotan et a/. 1 998). 

B. Transformation frequency was improved by LEC1 introduced using 
particle-mediated DNA delivery. 

A series of expression cassettes were made to evaluate the effects of 

10 LEC1 expression on maize transformation. The maize LEC1 polynucleotide was 
placed under the control of the In2 promoter (weakly induced with the auxin levels 
used under normal culture conditions and strongly-induced with safener), the 
barley NUC1 promoter (expressed strongly in the nucellus), the Ubiquitin 
promoter (strongly expressed constitutively), and the nos promoter (weakly 

15 expressed constitutively). A frame-shift version of the ln2:LEC1 cassette was 
made along with an ln2:ZM-NF-YB (designated as ln2:HAP3 henceforth) 
construct (The maize ZM NF-YB is non-LEC1 type of HAP3 transcriptional 
activator (Li et al Nucleic Acids Res. 20:1087-1091) for use as negative controls. 
All of these constructs were co-bombarded with the Pat~GFP fusion construct 

20 (designated as PAT-GFP) into high type II embryos as described in Example 7. 
Also, as in Example 7, immature embryos were harvested from separate ears, and 
the embryos from each ear were divided equally between treatments to account 
for ear-to-ear variability (for example, in an experiment comparing a control 
plasmid with this same plasmid + LEC1, one-half the total embryos from each ear 

25 would be used for each treatment. In some cases the control treatment contained 
the Pat~GFP construct co-bombarded with GUS. Transformation frequency was 
determined by counting the numbers of embryos with large multicellular GFP- 
positive cells clusters using a GFP microscope, and representing these as a 
percentage of the original number of embryos bombarded. No distinction was 

30 made between embryos with single or multiple events. In all cases, the functional 
LEC1 expression cassettes increased transformation frequencies over the control 
treatment (the LEC1 expression cassette also increased the incidence of multiple, 
i.e. 2-3, multicellular transgenic clones growing from the same immature embryo, 
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but as stated above we only scored these as a single event, and are providing a 
conservative representation of LEd's ability to improve transformation). For 
example, transformation frequencies in control treatments for three consecutive 
experiments were 5.1 , 7.4 and 0.8%. In balanced side-by-side comparisons for 
5 the same three experiments, transformation frequencies with the LEC1 

polynucleotide (ln2::LEC1::pinll) were 28.8, 25.7 and 12.4%, respectively. In 
addition to increasing the absolute number of transformants recovered from a 
given amount of target tissue, LEC1 transformants appeared earlier than the 
control transformants (suggesting that the LEC1 polynucleotide also stimulated 

10 growth rates). 

As a more stringent control, an expression vector was constructed in which 
a LEC1 gene, frame-shifted immediately after the start codon, was placed behind 
the In2 promoter. In this experiment using embryos from 3 separate ears, the 
transformation frequencies in the control (frame-shifted LEC1) treatments were 

1 5 2.7, 6.0 and 2.0%, while the transformation frequencies for the LEC1 treatments 
were 62.7, 26 and 42.7%. This demonstrated clearly that expression of the in- 
frame LEC1 polynucleotide was associated with dramatic increases in 
transformation efficiency. 

Increasing the promoter strength (driving LEC1 expression) increased 

20 transformation frequencies. For example, an experiment was performed to 
compare the In2, nos and UBI promoters. Based on our experience with these 
two promoters driving other genes, the In2 promoter (in the absence of an inducer 
other than auxin from the medium) would drive expression at very low levels. The 
nos promoter has been shown to drive moderately-low levels of transgene 

25 expression (approximately 1 0- to 30-fold lower than the maize ubiquitin promoter, 
but still stronger than In2 under the culture conditions used in this experiment). 
Two control treatments were used in this experiment; the frame-shifted LEC1 
driven by the In2 promoter, or a maize ln2:HAP3 polynucleotide (a "non-LEC1 
type" representative of the transcriptional factor family to which LEC1 belongs). 

30 Both control treatments resulted in low transformation frequencies. After 3 weeks, 
the transformation frequency for the ln2:frame-shift-LEC1 (FS) treatment was 
4.8%, while for the ln2:HAP3 treatment it was 2%. The ln2:LEC1, nos:LEC1 and 
UBLLEC1 treatments resulted in 14%, 28% and 30% transformation frequencies, 
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respectively. Within these treatments there was also an increase in the overall 
frequency of large, rapidly growing calli. For the control treatments, the 
frequencies of large, vigorous GFP+ calli (relative to the starting number of 
embryos) was low (1 .6 and 0% for the frame-shift or ln2:HAP3, respectively). For 

5 the In2, nos and UBI treatments the overall frequencies of large, vigorous calli 
was 4, 13.3 and 20%, respectively. This is consistent with the interpretation that 
increased LEC1 expression resulted in more rapid in vitro growth of transgenic 
tissue. As is typical for transformation experiments scored in this fashion, 
between 3-6 weeks the number of recovered transformants continues to rise. 

1 0 After 5 weeks (post-bombardment), the frequency of healthy, growing 

transformants was 4.8 and 7.3% for the FS and HAP3 controls, while for the In2, 
nos and UBI-driven LEC1 treatments the frequencies were 22, 29.3 and 35.3%. 



C. Transformation frequency was improved by LEC1 introduced using 
15 agrobacterium. 

The Agrobacterium strains containing the superbinary plasmids described 
in Example 8A were used to transformed High type II embryos. Briefly, colonies 
containing the engineered Agrobacterium were grown to log phase in minimal A 
medium. Log phase cells were collected by centrifugation and resuspended in 

20 561 Q medium (N6 salts, Eriksson's vitamins, 1 .5mg/l 2,4-D , 68.5g/l sucrose, 
36g/i glucose, plus 20mg/l acetosyringone). Immature embryos, 1.5-2mm in 
length, were excised and immersed in this solution at a concentration of 5 X 10 8 
bacterial cells/ml. Embryos were vortexed in this medium and allowed to sit for 5 
minutes. The embryos were then removed and placed on 562P medium (560P 

25 medium with 1 0OmM acetosyringone and incubated at 20°C for 3 days. Embryos 
were moved again to 563N medium (an agar solidified medium similar to 560P 
with 100mg/l carbenicillin, 0.5g/l MES and reduced 2,4-D) and cultured at 28°C for 
3 days. Embryos were then moved to 5630 medium (563N medium with 3mg/l 
bialaphos) and transferred thereafter every 14 days to fresh 5630 medium. 

30 Bialaphos resistant GFP+ colonies were counted using a GFP microscope 

and transformation frequencies were determined as described in example 8B. 
Similar to particle gun experiments, transformation frequencies were greatly 
increased in the LEC1 treatment. For example, transformation frequencies for the 
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control treatment across embryos taken from 7 separate ears were 7.1 , 40.9, 
11.1, 7 4, 11.5, 12, 30.8, and 16.6%. The side-by-side comparison for the LEC 
treatment (in the same order of ears as above) showed that transformation 
frequencies were 13.5, 47, 55.8, 37.1, 40.6, 30, 57.1 and 40.8%. Averaged 

5 across all 7 ears, the average transformation frequency for the control was 1 6.6% 
while that of the LEC1 treatment was 40.8%. This represents a substantial 
increase for an already high baseline produced by Agrobacterium-mediated 
transformation. Comparing across ears, it was observed that the beneficial 
effects on transformation frequency were the greatest when the control 

1 0 frequencies were low. 

D. Transformants were recovered using LEC1 expression under reduced 
auxin levels or in the absence of auxins in the medium, and in the absence 
of herbicide or antibiotic selection. 

15 To determine if LEC1 could be used in a positive selection scheme, 

particle gun transformation experiments were initiated as described in Example 4 
and transformants were selected on medium with normal auxin levels, or on 
medium with reduced or no auxin, or visually (using GFP) on medium without 
bialaphos. Transformation frequencies were based on the numbers of embryos 

20 with one or more multicellular GFP positive cell clusters. In the first experiment to 
test this concept, there were two treatment variables. The first was that immature 
embryos were bombarded with the control plasmid (UBI:PAT~GFP) or with 
UBI:PAT~GFP+ln2:LEC1. The second variable was that the bombarded embryos 
were divided onto either normal bialaphos-containing selection medium (with 

25 normal auxin levels of 2 mg/l 2,4-D), or medium with no bialaphos and reduced 
2,4-D levels (0.5 mg/l). As expected from previous results, on bialaphos selection 
the LEC1 treatment resulted in a higher transformation frequency than the control 
(5.7 versus 2.5%). It was also anticipated that the low auxin medium (0.5 mg/l 
2,4-D) would result in reduced growth rates. Consistent with this, for the control 

30 plasmid treatment (UBI:PAT-GFP), recovery of GFP-expressing (fluorescent) 
colonies was reduced relative to highly-effective bialaphos-selection treatment, 
dropping down to 0.6%. In contrast, it appeared that LEC1 expression, through 
its stimulation of embryogenesis, may have compensated for the low auxin 
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environment, providing a growth advantage to the transgenic colonies, and 
maintaining the efficiency of transformant recovery at 4.0% (still in the same 
range as the LEC1/bialaphos-selected treatment). It's clear from this result that 
the inclusion of LEC1 improved colony growth on reduced auxin relative to the 
control. 

On medium completely devoid of auxin, colonies were only observed in the 
LEC1 treatment. In this experiment, immature embryos were bombarded with 
either the control plasmid (UBI:PAT~GFP) or with UBI:PAT~GFP+ln2:LEC1, and 
then plated either onto 3.0 mg/l bialaphos, 2.0 mg/1 2,4-D medium or onto no- 
bialaphos, no 2,4-D medium (in this latter treatment, wild-type maize callus will 
not exhibit embryogenic growth). Again, as expected, the LEC1 polynucleotide 
increased transformation to 22.7% over the control plasmid value of 8% on 
normal auxin-containing, bialaphos selection medium. Also, as expected, no 
transformants were recovered with the control plasmid on medium devoid of 
exogenous auxin. Surprisingly, in the LEC1 treated embryos transformants were 
recovered at a 4% frequency (this was still higher than the control plasmid on 

bialaphos selection). 

Even on auxin-containing medium, the LEC1 polynucleotide in combination 
with GFP+ expression can be used to recover transformants without chemical 
selection. For example, under these conditions the recovery of transformants was 
relatively efficient (16% compared to 18% for bialaphos selection), but this 
required more diligence than the low- or no-auxin treatments above to separate 
the GFP-expressing colonies from the growing callus population. 

E. LEC1 improves the embryogenic phenotype and regeneration capacity of 
inbreds. 

Immature embryos from the inbred PHP38 were isolated, cultured and 
transformed as described in example 4 with the following changes. Embryos were 
initially cultured on 601 H medium (a MS based medium with 0.1 mg/l zeatin, 2mg/l 
2,4-D, MS and SH vitamins, proline, silver nitrate, extra potassium nitrate, casein 
hydrolysate, gelrite, 10g/l glucose and 20g/l sucrose). Prior to bombardment 
embryos were moved to a high osmoticum medium (modified Duncan's with 2mg/l 
2,4-D and 12% sucrose). Post bombardment, embryos were moved to 601 H 
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medium with 3mg/l bialaphos for two weeks. Embryos were then moved to 601 H 
medium without proline and casein hydrolysate with 3mg/l bialaphos and 
transferred every two weeks. Transformation frequency was determined by 
counting the numbers of bialaphos resistant GFP-positive colonies. Colonies 
were also scored on whether they had an embryogenic (regenerate) or non- 
embryogenic phenotype. In PHP38, the LEC1 polynucleotide increased 
transformation frequency and improved the regenerative potential of the callus. 
For example, a balanced experiment (the embryos from each harvested ear were 
divided equally between treatments) was conducted in which PHP38 immature 
embryos were bombarded with the control plasmid (UBI::PAT~GFP::pinll) in one 
treatment, with the UBI::PAT-GFP::pinll plasmid + ln2::LEC1, or with the 
UBI::PAT~GFP::pinll plasmid + nuc1::LEC1 (a maize nucellus-specific promoter 
driving LEC1 expression). The frequency of GFP+ calli growing on bialaphos- 
containing media (relative to the starting number of embryos) was determined 6 
weeks after bombardment. For the control treatment, the transformation 
frequency was 1.2%, while for the ln2:LEC1 and nuc1::LEC1 treatments the 
transformation frequencies were 3.2 and 2.0% respectively. In addition, the 
presence of the LEC1 polynucleotide appeared to greatly improve the 
regeneration capacity of the recovered transformants. None of the control 
transformants (UB!::PAT~GFP::pinll alone) had an embryogenic, regenerable 
phenotype, while the transformants recovered from the ln2:LEC1 and nud::LEC1 
treatments all exhibited a more vigorous, embryogenic growth pattern. This has 
been born out in the ability to recover plants. Callus from the ln2:LEC1 and 
nuc1::LEC1 treatments has produced many healthy plants. 



Example 9 

Transient Expression of the LEC1 polynucleotide Product 
to Induce Somatic Embryogenesis 

It may be desirable to "kick start" somatic embryogenesis by transiently 
expressing the LEC-1 polynucleotide product. This can be done by delivering 
LEC1 5'capped polyadenylated RNA, expression cassettes containing LEC-1 
DNA, or LEC-1 protein. All of these molecules can be delivered using a biolistics 
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particle gun. For example 5'capped polyadenylated LEC1 RNA can easily be 
made in vitro using Amnion's mMessage mMachine kit Following the procedure 
outline above RNA is co-delivered along with DNA containing an agronomically 
useful expression cassette. The cells receiving the RNA will immediately form 
somatic embryos and a large portion of these will have integrated the agronomic 
gene. Plants regenerated from these embryos can then be screened for the 
presence of the agronomic gene. 

Example 10 
Use of the maize LEC1 to induce apomixis 

Maize expression cassettes directing LEC1 expression to the inner 
integument or nucellus can easily be constructed. An expression cassette 
directing expression of the LEC1 polynucleotide to the nucellus was made using 
the barley Nuc1 promoter. Embryos were co-bombarded with the selectable 
marker PAT fused to the GFP gene along with the nucellus specific LEC1 
expression cassette described above. Both inbred (PHP38) and GS3 
transformants were obtained and regenerated as described in examples 4 and 5. 
Transformation frequencies were also increased over the control using the 
nuc1:LEC1 polynucleotide (see Example 8 above). 

It is anticipated that the regenerated plants will then be capable of 
producing de novo embryos from LEC1 expressing nucellar cells. This is 
complemented by pollinating the ears to promote normal central cell fertilization 
and endosperm development. In another variation of this scheme, nud :LEC1 
transformations could be done using a FIE-null genetic background which would 
promote both de novo embryo development and endosperm development without 
fertilization (see Ohad et al. 1 999 The Plant Cell 1 1 :407-41 5; also pending US 
application Serial No. 60/151575 filed August 31, 1999). Upon microscopic 
examination of the developing embryos it will be apparent that apomixis has 
occurred by the presence of embryos budding off the nucellus. In yet another 
variation of this scheme the LEC1 polynucleotide could be delivered as described 
above into a homozygous zygotic-embryo-lethal genotype. Only the adventive 
embryos produced from somatic nucellus tissue would develop in the seed. 
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EXAMPLE 11 
Expression of Chimeric Genes in Microbial Cells 

The cDNAs encoding the instant transcription factors can be inserted into 
the T7 £. co// expression vector pBT430. This vector is a derivative of pET-3a 
(Rosenberg et al. (1987) Gene 56:125-135) which employs the bacteriophage T7 
RNA polymerase/T7 promoter system. Plasmid pBT430 was constructed by first 
destroying the EcoR I and Hind III sites in pET-3a at their original positions. An 
oligonucleotide adaptor containing EcoR I and Hind III sites was inserted at the 
BamH I site of pET-3a. This created pET-3aM with additional unique cloning sites 
for insertion of genes into the expression vector. Then, the Nde I site at the 
position of translation initiation was converted to an Nco I site using 
oligonucleotide-directed mutagenesis. The DNA sequence of pET-3aM in this 
region, 5'-CATATGG, was converted to 5-CCCATGG in pBT430. 

Plasmid DNA containing a cDNA may be appropriately digested to release a 
nucleic acid fragment encoding the protein. This fragment may then be purified 
on a 1% NuSieve GTG™ low melting agarose gel (FMC). Buffer and agarose 
contain 10 ug/ml ethidium bromide for visualization of the DNA fragment. The 
fragment can then be purified from the agarose gel by digestion with GELase™ 
(Epicentre Technologies) according to the manufacturer's instructions, ethanol 
precipitated, dried and resuspended in 20 pL of water. Appropriate 
oligonucleotide adapters may be ligated to the fragment using T4 DNA ligase 
(New England Biolabs, Beverly, MA). The fragment containing the ligated 
adapters can be purified from the excess adapters using low melting agarose as 
described above. The vector pBT430 is digested, dephosphorylated with alkaline 
phosphatase (NEB) and deproteinized with phenol/chloroform as described 
above. The prepared vector pBT430 and fragment can then be ligated at 16°C 
for 15 hours followed by transformation into DH5 electrocompetent cells (GIBCO 
BRL). Transformants can be selected on agar plates containing LB media and 
100 jxg/mL ampicillin. Transformants containing the polynucleotide encoding the 
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transcription factor are then screened for the correct orientation with respect to 
the T7 promoter by restriction enzyme analysis. 

For high level expression, a plasmid clone with the cDNA insert in the 
correct orientation relative to the T7 promoter can be transformed into £ coli 

5 strain BL21(DE3) (Studier et a/. (1986) J. Mol. Biol. 789:113-130). Cultures are 
grown in LB medium containing ampicillin (100 mg/L) at 25°C. At an optical 
density at 600 nm of approximately 1, IPTG (isopropylthio-p-galactoside, the 
inducer) can be added to a final concentration of 0.4 mM and incubation can be 
continued for 3 hours at 25°C. Cells are then harvested by centrifugation and re- 

1 0 suspended in 50 pL of 50 mM Tris-HCI at pH 8.0 containing 0. 1 mM DTT and 0.2 
mM phenyl methylsulfonyl fluoride. A small amount of 1 mm glass beads can be 
added and the mixture sonicated 3 times for about 5 seconds each time with a 
microprobe sonicator. The mixture is centrifuged and the protein concentration of 
the supernatant determined. One ug of protein from the soluble fraction of the 

1 5 culture can be separated by SDS-polyacrylamide gel electrophoresis. Gels can 
be observed for protein bands migrating at the expected molecular weight. 

EXAMPLE 12 

Evaluating Compounds for Their Ability to Inhibit the Activity 
20 of Plant Transcription Factors 

The transcription factors described herein may be produced using any 
number of methods known to those skilled in the art. Such methods include, but 
are not limited to, expression in bacteria as described in Example 6, or expression 

25 in eukaryotic cell culture, in planta, and using viral expression systems in suitably 
infected organisms or cell lines. The instant transcription factors may be 
expressed either as mature forms of the proteins as observed in vivo or as fusion 
proteins by covalent attachment to a variety of enzymes, proteins or affinity tags. 
Common fusion protein partners include glutathione S-transferase ("GST"), 

30 thioredoxin ("Trx"), maltose binding protein, and C- and/or N-terminal 

hexahistidine polypeptide ( u (His) 6 "). The fusion proteins may be engineered with 
a protease recognition site at the fusion point so that fusion partners can be 
separated by protease digestion to yield intact mature enzyme. Examples of such 
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proteases include thrombin, enterokinase and factor Xa. However, any protease 
can be used which specifically cleaves the peptide connecting the fusion protein 
and the enzyme. 

Purification of the instant transcription factors, if desired, may utilize any 

5 number of separation technologies familiar to those skilled in the art of protein 
purification. Examples of such methods include, but are not limited to, 
homogenization, filtration, centrifugation, heat denaturation, ammonium sulfate 
precipitation, desalting, pH precipitation, ion exchange chromatography, 
hydrophobic interaction chromatography and affinity chromatography, wherein the 

1 0 affinity ligand represents a substrate, substrate analog or inhibitor. When the 
transcription factors are expressed as fusion proteins, the purification protocol 
may include the use of an affinity resin which is specific for the fusion protein tag 
attached to the expressed enzyme or an affinity resin containing ligands which 
are specific for the enzyme. For example, a transcription factor may be 

1 5 expressed as a fusion protein coupled to the C-terminus of thioredoxin. In 
addition, a (His) 6 peptide may be engineered into the N-terminus of the fused 
thioredoxin moiety to afford additional opportunities for affinity purification. Other 
suitable affinity resins could be synthesized by linking the appropriate ligands to 
any suitable resin such as Sepharose-4B. In an alternate embodiment, a 

20 thioredoxin fusion protein may be eluted using dithiothreitol; however, elution may 
be accomplished using other reagents which interact to displace the thioredoxin 
from the resin. These reagents include p-mercaptoethanol or other reduced thiol. 
The eluted fusion protein may be subjected to further purification by traditional 
means as stated above, if desired. Proteolytic cleavage of the thioredoxin fusion 

25 protein and the enzyme may be accomplished after the fusion protein is purified 
or while the protein is still bound to the ThioBond™ affinity resin or other resin. 

Crude, partially purified or purified enzyme, either alone or as a fusion 
protein, may be utilized in assays for the evaluation of compounds for their ability 
to inhibit enzymatic activition of the transcription factors disclosed herein. Assays 

30 may be conducted under well-known experimental conditions that permit optimal 
enzymatic activity. 
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Example 13 

LEC1 expression resulted in increased growth rates, which could be used 
as a screening criterion for positive selection of transformants. 

5 Using two promoters of increasing strength to drive LEC1 expression in maize, 
it appeared that LEC1 stimulated callus growth over control treatments and the 
stronger promoter driving LEC1 resulted in faster growth than with the low-level 
promoter. For example, an experiment was performed to compare the In2 and 
nos promoters. As noted above, based on our experience with these two 

1 0 promoters driving other genes, the In2 promoter (in the absence of an inducer 
other than auxin from the medium) would drive expression at very low levels. The 
nos promoter has been shown to drive moderately-low levels of transgene 
expression (approximately 10- to 30-fold lower than the maize ubiquitin promoter, 
but still stronger than In2 under the culture conditions used in this experiment). 

1 5 One control treatment was used in this experiment, the UBI:PAT~GFPmo:pinll 
construct by itself (with no LEC1). Hi-ll immature embryos were bombarded as 
previously described, and transgenic, growing events were scored at 3 and 6 
weeks. The control treatment resulted in a transformation frequency of 0.8%. 
The ln2:LEC1 and nos:LEC1 treatments resulted in transformation frequencies of 

20 26.5 and 40.7%, respectively. 

Within these treatments there was also an increase in the overall frequency of 
large, rapidly growing calli, relative to the control treatment. For this data, the 
fresh weight of transformed calli were recorded 2 months after bombardment. 
Assuming that all the transgenic events started as single transformed cells within 

25 a few days after bombardment, these weights represent the relative growth rate of 
these transformants during this period (all tissue was sub-cultured and weighed 
for each transformant; mean weights and standard deviations were calculated for 
each treatment). For the control treatment, the mean transformant weight after 
two months was 37 +/- 15 mg (n=6). For the ln2:LEC1 and nos:LEC1 treatments, 

30 the mean transformant weights were 126 +/- 106 and 441 +/- 430 mg, 

respectively. If the control treatment was set at a relative growth value of 1 .0, this 
means that transformants in the ln2:LEC1 and nos:LEC1 treatments grew 3.4 and 
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12-fold faster than the control. From this data, it appears that increasing LEC1 
expression resulted in a concomitant increase in callus growth rate. 

Example 14 

5 The use of LEC1 polynucleotide as a positive selection system for wheat 
transformation and for improving the regeneration capacity of wheat tissues 

Method 
Plant Material 

1 0 Seeds of wheat Hybrinova lines NH535 and BO 01 4 were sown into soil in 

plug trays for vernalisation at 6°C for eight weeks. Vernalized seedlings were 
transferred in 8" pots and grown in a controlled environment room. The growth 
conditions used were; 1) soil composition: 75% L&P fine-grade peat, 12% 
screened sterilized loam, 10% 6 mm screened, lime-free grit, 3% medium grade 

1 5 vermiculite, 3.5 kg Osmocote per m 3 soil (slow-release fertiliser, 15-11-13 NPK 
plus micronutrients), 0.5 kg PG mix per m 3 (14-16-18 NPK granular fertiliser plus 
micronutrients, 2) 16 h photoperiod (400W sodium lamps providing irradiance of 
ca. 750 uE s" 1 m' 2 ), 18 to 20 8 C day and 14 to 16°C night temperature, 50 to 70% 
relative air humidity and 3) pest control: sulphur spray every 4 to 6 weeks and 

20 biological control of thrips using Amblyseius caliginosus (Novartis BCM Ltd, UK). 

Isolation of Explants and Culture Initiation 

Two sources of primary explants were used; scutellar and inflorescence 
tissues. For scutella, early-medium milk stage grains containing immature 

25 translucent embryos were harvested and surface-sterilized in 70% ethanol for 5 
min and 0.5% hypochlorite solution for 15-30 min. For inflorescences, tillers 
containing 0.5-1.0 cm inflorescences were harvested by cutting below the 
inflorescence-bearing node (the second node of a tiller). The tillers were trimmed 
to approximately 8-10 cm length and surface-sterilized as above with the upper 

30 end sealed with Nescofilm (Bando Chemical Ind. Ltd, Japan). 

Under aseptic conditions, embryos of approximately 0.5-1.0 mm length 
were isolated and the embryo axis removed. Inflorescences were dissected from 
the tillers and cut into approximately 1 mm pieces. Thirty scutella or 1 mm 
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inflorescence explants were placed in the center (18 mm target circle) of a 90 mm 
Petri dish containing MD0.5 or L7D2 culture medium. Embryos were placed with 
the embryo-axis side in contact with the medium exposing the scutellum to 
bombardment whereas inflorescence pieces were placed randomly. Cultures 
5 were incubated at 25±°C in darkness for approximately 24 h before bombardment. 
After bombardment, explants from each bombarded plate were spread across 
three plates for callus induction. 

Culture Media 

1 0 The standard callus induction medium for scutellar tissues (MD0.5) 

consisted of solidified (0.5% Agargel, Sigma A3301 ) modified MS medium 
supplemented with 9% sucrose, 10 mg I" 1 AgNOs and 0.5 mg I* 1 2,4-D (Rasco- 
Gaunt ef a/., 1999). Inflorescence tissues were cultured on L7D2 which consisted 
of solidified (0.5% Agargel) L3 medium supplemented with 9% maltose and 2 mg 

15 r 1 2,4-D (Rasco-Gaunt and Barcelo, 1999). The basal shoot induction medium, 
RZ contained L salts, vitamins and inositol, 3% w/v maltose, 0.1 mg P 2,4-D and 
5 mg P zeatin (Rasco-Gaunt and Barcelo, 1999). Regenerated plantlets were 
maintained in RO medium with the same composition as RZ, but without 2,4-D 
and zeatin. 

20 

DNA Precipitation Procedure and Particle Bombardment 

Submicron gold particles (0.6 urn Micron Gold, Bio-Rad) were coated with 
a plasmid containing the maize ln-2:LEC1 construct following the protocol 
modified from the original Bio-Rad procedure (Barcelo and Lazzeri, 1995). The 

25 standard precipitation mixture consisted of 1 mg of gold particles in 50 pi SDW, 
50 pi of 2.5 M calcium chloride, 20 pi of 100 mM spermidine free base and 5 pi 
DNA (concentration 1 pg pi" 1 ). After combining the components, the mixture was 
vortexed and the supernatant discarded. The particles were then washed with 
150 pi absolute ethanol and finally resuspended in 85 pi absolute ethanol. The 

30 DNA/gold ethanol solution was kept on ice to minimise ethanol evaporation. For 
each bombardment, 5 pi of DNA/gold ethanol solution (ca. 60 pg gold) was 
loaded onto the macrocarrier. 
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particle bombardments were carried out using DuPont PDS 1000/He gun 
with a target distance of 5.5 cm from the stopping plate at 650 psi acceleration 
pressure and 28 in. Hg chamber vacuum pressure. 

5 Regeneration of Transformants 

For callus induction, bombarded explants were distributed over the surface 
of the medium in the original dish and two other dishes and cultured at 25+1°C in 
darkness for three weeks. Development of somatic embryos from each callus 
were periodically recorded. For shoot induction, calluses were transferred to RZ 

1 o medium and cultured under 1 2 h light (250 \iE s 1 nY 2 , from cool white fluorescent 
tubes) at 25+1 °C for three weeks for two rounds. All plants regenerating from the 
same callus were noted. Plants growing more vigorously than the control cultures 
were potted in soil after 6-9 weeks in R0 medium. The plantlets were 
acclimatized in a propagator for 1-2 weeks. Thereafter, the plants were grown to 

1 5 maturity under growth conditions described above. 

DNA Isolation from Callus and Leaf Tissues 

Genomic DNA was extracted from calluses or leaves using a modification 
of the CTAB (cetyltriethylammonium bromide, Sigma H5882) method described by 

20 Stacey and Isaac (1 994). Approximately 1 00-200 mg of frozen tissues was 

ground into powder in liquid nitrogen and homogenised in 1 ml of CTAB extraction 
buffer (2% CTAB, 0.02 M EDTA 0.1 M Tris-CI pH 8, 1.4 M NaCI, 25 mM DTT) for 
30 min at 65°C. Homogenised samples were allowed to cool at room temperature 
for 15 min before a single protein extraction with approximately 1 ml 24:1 v/v 

25 chlorofomroctanol was done. Samples were centrifuged for 7 min at 1 3,000 rpm 
and the upper layer of supernatant collected using wide-mouthed pipette tips. 
DNA was precipitated from the supernatant by incubation in 95% ethanol on ice 
for 1 h. DNA threads were spooled onto a glass hook, washed in 75% ethanol 
containing 0.2 M sodium acetate for 10 min, air-dried for 5 min and resuspended 

30 in TE buffer. Five pi RNAse A was added to the samples and incubated at 37°C 
for 1 h. 
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For quantification of genomic DNA, gel electrophoresis was performed 
using an 0.8% agarose gel in 1xTBE buffer. One microlitre of the samples were 
fractionated alongside 200, 400, 600 and 800 ng pi* 1 ^ uncut DNA markers. 

5 Polymerase chain reaction (PCR) analysis 

The presence of the maize LEC1 polynucleotide was analyzed by PCR 
using 100-200 ng template DNA in a 30 ml PCR reaction mixture containing 1X 
concentration enzyme buffer (10 mM Tris-HCI pH 8.8, 1.5 mM magnesium 
chloride, 50 mM potassium chloride, 0.1% Triton X-100), 200 pM dNTPs, 0.3 pM 

1 0 primers and 0.022 U TaqDNA polymerase (Boehringer Mannheim). 

Thermocycling conditions were as follows (30 cycles): denaturation at 95°C for 30 
s, annealing at 55°C for 1 min and extension at 72°C for 1 min. Primer sequences 
(F=forward; R=reverse) used were: (F) 5'-CGC TCT GTC ACC TGT TGT ACT C- 
3' (R) 5'-CGT GAT GAA GCT GAT GTA CTC C-3'. Approximate PCR product 

15 length was 620 bp. 

Results 

Following experiments to show increased regeneration capacity and 
improvement of maize transformation frequencies by expression of maize LEC1, 
20 the polynucleotide was then introduced into wheat scutellar and inflorescence 
explants, driven by the maize In2 promoter. Both tissues are used for wheat 
transformation. 

Subsequent to the induction of somatic embryos from both tissues after 
three weeks on a 2,4-D-containing induction medium, calluses were assessed 
25 prior to transfer onto shoot regeneration medium. Callus assessment involved: a) 
scoring calluses as 0 = non-embryogenic callus, 1 = 25%, 2 = 25-50%, 3 = 50- 
75%, 4 = 75-100% of callus surface embryogenic, and b) determining 
embryogenic capacity expressed in percentage as the number of embryogenic 
calluses / total number of calluses (scutella or inflorescence) assessed. 

30 

Scutellum calluses 

Mean callus scores of control (1.4 ± 0.3) and LEC-bombarded (1.4 ± 0.3) 
scutellar tissues of wheat line NH535 were not significantly different. However, 
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callus score of LEC-bombarded scutella (1.5 ± 0.5) of wheat line BO 014 was 
significantly improved in comparison with the control (0.5 ± 0.2). Similarly, 
embryogenic capacity of line NH535 did not seem to be affected by LEC 
treatment (LEC calluses = 84.3 ± 9.3%. control calluses = 90%). However, LEC- 
bombarded line BO 014 had clear increases in the embryogenic callus frequency 
(LEC calluses = 75.4 ± 16.8%, control calluses = 36.7 ± 4.7%). Examining the 
quality of embryogenic calluses formed, both lines showed significant increases in 
the number of 'good' calluses produced i.e. calluses with scores of 3 or 4. 'Good* 
quality callus of line NH535 increased from 5 to 22.3 % whilst line BO 014 
increased from 0 to 23.6%. These calluses were generally large, rapidly growing 
and vigorous. 

After callus induction and assessment, calluses were transferred onto 
shoot induction media for a total of six weeks. Shoot regeneration of calluses was 
determined, as the number of shoot regenerating calluses / total number of 
calluses assessed (expressed as percentages). Shoot regeneration of cultures 
corresponded with the quality and quantity of somatic embryos produced in each 
callus. Hence, regeneration of LEC-bombarded (71 .9 ± 12.1 ) and control (70 ± 
14) callus tissues of line NH535 were not significantly different. However, 
regeneration of LEC-bombarded calluses (52.3 ± 26.9) of wheat line BO 014 was 
significantly improved in comparison with the control (15.6 ± 6.3). 

To test the suitability of LEC as a positive selection system for wheat, 
sample tissues from vigorous calluses were analyzed for the presence of LEC 
sequences. Forty-one BO 014 and 13 NH535 calluses were selected. The 
results were that 10/41 BO 014 calluses and 8/13 NH535 were PCR positive. 
Thus, transformed lines were identified without selection at frequencies of 24.4% 
and 61.5%. These frequencies are comparable with conventional selection 
systems such as herbicide- and antibiotic-resistance systems (e.g. oar, npfll) 
applied in wheat transformation where selection 'escape' frequencies are 
commonly high and variable. Furthermore, we know of no other report of wheat 
transformation by morphological selection in the absence of a selection agent. 

Callus transformation frequencies were 5.6% and 4.4% in NH535 and BO 
014 lines, respectively. Transgenic plants were also recovered from LEC-positive 
callus lines. Seven non-clonal plants were recovered from NH535 and six non- 
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clonal plants were recovered from BO 014 to give plant transformation 
frequencies of 3.9 and 3.3%, respectively, based on the number of explants 
bombarded. 



5 Inflorescence calluses 

The use of inflorescence tissues as explants for the tissue culture and 
transformation of wheat offer several advantages over seed explants such as 
scutella (Rasco-Gaunt and Barcelo, 1999). However, responses of these tissues 
to culture are highly genotype-dependent and calluses are often non-regenerative 

10 despite having a 'highly-embryogenic' appearance. Hence, LEC was introduced 
into inflorescence tissues to see whether regeneration could be enhanced on a 
poorly regenerating line such BO 014. 

Using line BO 014, shoot regeneration was significantly improved in LEC- 
bombarded tissues, although callus quality appeared similar in bombarded and 

15 control tissues. Whereas no shoot was regenerated from control cultures, eight 
plants were regenerated from LEC-bombarded calluses to give a shoot 
regeneration frequency of 10.7%. 

Tables 

20 

Summary of tissue culture data 



Wheat 
line 


Treatment 


Mean 
callus 
score 
(0-4) 


Embryo. 
Capacity 

(%) 


% Good 
calluses 
(score 
3&4) 


Regeneration 
(%) 




NH535 


Control 


1.4 ±0.3 


90.0 


5 


70 ±14 




LEC1 


1.4 ±0.3 


84.3 ± 9.3 


22.3 


71.9 ±12.1 


BO 014 


Control 


0.5 ±0.2 


36.7 ±4.7 


0 


15.6 ±6.3 




LEC1 


1.5 ±0.5 


75.4 ± 
16.8 


23.6 


52.3 ±26.9 
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Type of callus produced per treatment per line 



Callus Score 


NH535 


BOI 


314 




Control 


LEC1 


Control 


LEC1 


0 


10 


15.7 


63.2 


25 


1 


45 


43.3 


26.3 


30 


2 


40 


28.7 


10.5 


21.4 


3 


5 


11.2 


0 


17.9 


4 


0 


1.1 


^ 0 


5.7 


Transformation Frequency 


Wheat line 


Callus line 


Plant line (non-clonal) 


NH535 


10/180(5.6%) 


7/180 (3.9%) 


BO 014 


8/180(4.4%) 


6/180(3.3%) 



10 

EXAMPLE 15 
Expression of Chimeric Genes in Dicot Cells 

The LEC1 polynucleotide can also be used to improve the transformation of 
1 5 soybean. To demonstrate this the construct consisting of the In2 promoter and 
LEC1 coding region were introduced into embryogenic suspension cultures of 
soybean by particle bombardment using essentially the methods described in 
Parrott, WA, LM. Hoffman, D.F. Hildebrand, E.G. Williams, and G.B. Collins, 
(1989) Recovery of primary transformants of soybean, Plant Cell Rep. 7:615-617. 
20 This method with modifications is described below. 

Seed was removed from pods when the cotyledons were between 3 and 5 
mm in length. The seeds were sterilized in a Chlorox solution (0.5%) for 15 
minutes after which time the seeds were rinsed with sterile distilled water. The 
immature cotyledons were excised by first cutting away the portion of the seed 
25 that contains the embryo axis. The cotyledons were then removed from the seed 
coat by gently pushing the distal end of the seed with the blunt end of the scalpel 
blade. The cotyledons were then placed (flat side up) SB1 initiation medium (MS 
salts, B5 vitamins, 20 mg/L 2,4-D, 31 .5 g/l sucrose, 8 g/L TC Agar, pH 5.8). The 
Petri plates were incubated in the light (16 hr day; 75-80 uE) at 26° C. After 4 
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weeks of incubation the cotyledons were transferred to fresh SB1 medium. After 
an additional two weeks, globular stage somatic embryos that exhibit proliferative 
areas were excised and transferred to FN Lite liquid medium (Samoylov, V.M., 
D.M. Tucker, and WA Parrott (1998) Soybean [Glycine max (L.) Merrill] 

5 embryogenic cultures: the role of sucrose and total nitrogen content on 

proliferation. In Vitro Cell Dev. Biol.- Plant 34:8-13). About 10 to 12 small clusters 
of somatic embryos were placed in 250 ml flasks containing 35 ml of SB172 
medium. The soybean embryogenic suspension cultures were maintained in 35 
mL liquid media on a rotary shaker, 150 rpm, at 26°C with florescent lights (20 nE) 

10 on a 16:8 hour day/night schedule. Cultures were sub-cultured every two weeks 
by inoculating approximately 35 mg of tissue into 35 mL of liquid medium. 

Soybean embryogenic suspension cultures were then be transformed using 
particle gun bombardment (Klein et al. (1987) Nature (London) 327:70, U.S. 
Patent No. 4,945,050). A BioRad Biolistic™ PDS1000/HE instrument was used for 

1 5 these transformations. A selectable marker gene which was used to facilitate 
soybean transformation is a chimeric gene composed of the 35S promoter from 
Cauliflower Mosaic Virus (Odell etal. (1985) Nature 373:810-812), the 
hygromycin phosphotransferase gene from plasmid pJR225 (from E. coli; Gritz et 
al. (1983) Gene 25:179-188) and the 3' region of the nopaline synthase gene from 

20 the T-DNA of the Ti plasmid of Agrobacterium tumefaciens. 

To 50 uL of a 60 mg/mL 1 urn gold particle suspension was added (in order): 
5 uL DNA (1 ug/fiL), 20 \l\ spermidine (0.1 M), and 50 uL CaCI 2 (2.5 M). The 
particle preparation was agitated for three minutes, spun in a microfuge for 1 0 
seconds and the supernatant removed. The DNA-coated particles were washed 

25 once in 400 u.L 70% ethanol and resuspended in 40 uL of anhydrous ethanol. 
The DNA/particle suspension was sonicated three times for one second each. 
Five uL of the DNA-coated gold particles were then loaded on each macro carrier 
disk. 

Approximately 300-400 mg of a two-week-old suspension culture was placed 
30 in an empty 60x1 5 mm petri dish and the residual liquid removed from the tissue 
with a pipette. Membrane rupture pressure was set at 1 100 psi and the chamber 
was evacuated to a vacuum of 28 inches mercury. The tissue was placed 
approximately 8 cm away from the retaining screen, and was bombarded three 
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times. Following bombardment, the tissue was divided in half and placed back 
into 35 ml of FN Lite medium. 

Five to seven days after bombardment, the liquid medium was exchanged 
with fresh medium. Eleven days post bombardment the medium was exchanged 
with fresh medium containing 50 mg/mL hygromycin. This selective medium was 
refreshed weekly. Seven to eight weeks post bombardment, green, transformed 
tissue was observed growing from untransformed, necrotic embryogenic clusters. 
Isolated green tissue was removed and inoculated into individual flasks to 
generate new, clonally propagated, transformed embryogenic suspension 
cultures. Each new line was treated as an independent transformation event. 
These suspensions were then subcultured and maintained as clusters of 
immature embryos, or tissue was regenerated into whole plants by maturation and 
germination of individual embryos. 

Two different genotypes were used in these experiments: 92B91 and 
93B82. Samples of tissue were either bombarded with the hygromycin resistance 
gene alone or with a 1:1 mixture of the hygromycin resistance gene and the LEC1 
construct. Embryogenic cultures generated from 92B91 generally produce 
transformation events while cultures from 93B82 are much more difficult to 
transform. For transformation experiments with 92B91 , approximately equal 
numbers of transformants were recovered from bombardments conducted with the 
LEC1 polynucleotide as without it. Twenty-nine transformants were recovered 
from the LEC1 -treated 92B91 tissue while 27 transformants were recovered from 
tissue receiving only the hygromycin resistance gene. In contrast, transformants 
were only recovered from 93B82 tissue receiving the LEC1 polynucleotide (none 
were recovered from the treatment using only the hygromycin resistance gene). 
Five transformants were recovered from 93B82 tissue bombarded with the LEC1 
polynucleotide while no transformants were recovered from tissue treated with 
only the hygromycin resistance gene. These results show that the LEC1 
polynucleotide will be very valuable for gene transfer to recalcitrant genotypes of 
soybean. 
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An isolated nucleic acid comprising a member selected from the group 
consisting of: 

(a) a polynucleotide which encodes a polypeptide of SEQ ID NO: 
2, 8, 10, 12,14, 16,18, 20, or 22; 

(b) a polynucleotide amplified from a plant nucleic acid library using 
the primers of SEQ ID NOS: 3 and 4, 5 and 6, 9 and 10, or 1 1 
and 12 or primers determined by using Vector nti Suite, InforMax 
Version 5. 

(c) a polynucleotide comprising at least 20 contiguous bases of 
SEQ ID NO: 1, 7, 9, 11, 13, 15, 17, 19, or 21; 

(d) a polynucleotide having a plant HAP3-type ccaat-box transcriptional 
activator, wherein the polynucleotide is from a plant other than 
Arabidopsisr, 

(e) a polynucleotide having at least 60% sequence identity to SEQ 
ID NO: 1 , 9, 1 1 , 13, 17, or 21 or 65% sequence identity to SEQ 
ID NO: 15 or 19 or 70% sequence identity to SEQ ID NO: 7, 
wherein the % sequence identity is based on the entire 
sequence and is determined by GAP analysis using Gap Weight 
of 50 and Length Weight of 3; 

(f) a polynucleotide comprising at least 25 nucleotides in length 
which hybridizes under high stringency conditions to a 
polynucleotide having the sequence set forth in SEQ ID NO: 
1,7, 9,11.13, 15,17,19, or 21; 

(g) a polynucleotide containing a conserved region coding the 
polypeptide of SEQ ID NO: 23, wherein the polynucleotide is 
from a plant other than Arabidopsis; 

(h) a polynucleotide having the sequence set forth in SEQ ID NO: 1 , 
7, 9,11,13, 15, 17, 19, or 21; and 

(i) a polynucleotide complementary to a polynucleotide of (a) 
through (h). 
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2. The isolated nucleic acid of claim 1 , wherein the transcriptional activator in 
(d) is from corn, soybean, wheat, rice, Veronia, or Argemone. 

3. The isolated nucleic acid of claim 1 wherein the polynucleotide in (f) is a 
5 functional equivalent. 

4. A vector comprising at least one nucleic acid of claim 1 . 

5. An expression cassette comprising at least one nucleic acid of claim 1 
10 operably linked to a promoter, wherein the nucleic acid is in sense or 

antisense orientation. 

6. The expression cassette of claim 5, wherein the nucleic acid is operably 
linked in antisense orientation to the promoter. 

15 

7. A host cell containing at least one expression cassette of claim 5. 

8. The host cell of claim 7 that is a plant cell. 

20 9. A transgenic plant comprising at least one expression cassette of claim 5. 

1 0. The transgenic plant of claim 9, wherein the plant is corn, soybean, 
sorghum, wheat, rice, alfalfa, sunflower, canola or cotton. 

25 11. A seed from the transgenic plant of claim 9. 

1 2. The seed from the transgenic plant of claim 1 0. 

1 3. An isolated protein comprising a member selected from the group 
30 consisting of: 

(a) a polypeptide comprising at least 25 contiguous amino acids of SEQ 
ID NO: 2, 8, 10, 12, 14, 16, 18, 20, or 22; 
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(b) a polypeptide which is a LEC1 transcriptional activator containing 
the conserved motifs of SEQ ID NO: 23, wherein the polypeptide is 
from a plant other than Arabidopsis, 

(c) a polypeptide comprising at least 60% sequence identity to SEQ ID 

5 NO: 2, 12, 14, 16, 20, or 22 or 70 % sequence identity to SEQ ID NO: 

8, 10, or 18, wherein the % sequence identity is based on the entire 
sequence and is determined by GAP analysis using Gap Weight of 12 
and Length Weight of 4; 

(d) a polypeptide encoded by a nucleic acid of claim 1 ; 

10 (e) a polypeptide encoded by a nucleic acid of SEQ ID NO: 1 , 7, 9, 1 1 , 

13, 15, 17, 19, or 21; and 
(f) a polypeptide having the sequence set forth in SEQ ID NO: 2, 8, 1 0, 
12, 14, 16, 18, 20, or 22. 



15 14. The protein of claim 1 3, wherein the transcriptional activator of (b) is a 
LEC1 protein from corn, soybeans, wheat, rice, Veronia, or Argemone. 

1 5. A ribonucleic acid sequence encoding a protein of claim 1 3. 

20 16. A method for modulating LEC1 activity in a plant, comprising: 

(a) transforming a plant cell with at least one expression cassette of 
claim 5 and 

(b) regenerating a transformed plant that expresses the at least one 
nucleic acid in an amount sufficient to modulate LEC1 activity in the 

25 plant. 

17. The method of claim 16, wherein the plant is corn, soybean, sorghum, 
wheat, rice, alfalfa, sunflower, canola or cotton. 

30 18. The method of claim 1 6, wherein LEC1 activity is increased. 

19. The method of claim 1 6, wherein LEC1 activity is decreased. 
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20. A method for transiently modulating the level of LEC1 activity in plant cells 
comprising introducing at least one nucleic acid of claim 1. 



21 . The method of claim 20 wherein the at least one nucleic acid is a 
5 ribonucleic acid. 

22. A method for transiently modulating the level of LEC1 activity in plant cells 
comprising introducing at least one polypeptide of claim 13. 

10 23. A method for enhancing tissue culture response in a plant cell comprising 
introducing into the plant cell at least one LEC1 polypeptide or at least one 
LEC1 polynucleotide under conditions sufficient to enhance tissue culture 
response. 

1 5 24. The method of claim 23 wherein the plant cell is transformed with at least 
one LEC1 polynucleotide. 

25. The method of claim 23 wherein at least one polynucleotide is operably 
linked to a promoter driving expression in the plant cell and growing the 

20 plant cell. 

26. The method of claim 23 wherein the plant cell is a recalcitrant cell. 

27. The method of claim 26 wherein the plant cell is an inbred plant cell. 

25 

28. A method for inducing somatic embryogenesis in a plant cell comprising 
introducing into a responsive plant cell at least one LEC1 polypeptide or at 
least one LEC1 polynucleotide, wherein the responsive plant cell is 
transformed and grown under conditions sufficient to stimulate the 

30 production of a transformed embryo and wherein the plant cell is other than 

an Arabidopsis cell. 
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29. The method of claim 28 wherein the plant cell is transformed with at least 
one LEC1 polynucleotide. 

30. The method of claim 29 wherein at least one polynucleotide is operably 
5 linked to a promoter driving expression in the plant cell. 

31 . The method of claim 28 further comprising growing the transformed embryo 
under plant growing conditions to produce a regenerated plant. 

10 32. The method of claim 28 wherein the plant cell is from corn, soybean, 
sorghum, wheat, rice, alfalfa, sunflower, canola or cotton. 

33. A plant produced by the method of claim 28. 

15 34. A method for positive selection of a transformed cell comprising introducing 
into a responsive plant cell at least one LEC1 polynucleotide or at least one 
LEC1 polypeptide and growing the transformed plant cell, wherein the 
responsive plant cell is transformed and grown under conditions sufficient 
to induce embryogenesis to provide a positive selection means. 

20 

35. The method of claim 34 wherein the plant cell is transformed with at least 
one polynucleotide. 

36. The method of claim 35 wherein at least one polynucleotide is operably 
25 linked to a promoter driving expression in a plant. 

37. The method of claim 34 further comprising altering media components to 
favor the growth of transformed plant cells. 

30 38. The method of claim 34 wherein the media components are altered to 
reduce somatic embryogenesis in non-transformed cells. 

39. The method of claim 36 wherein the polynucleotide is excised. 
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40. The method of claim 39 wherein the polynucleotide is flanked by FRT 
sequences to allow FLP mediated excision of the polynucleotide. 

41 . A method for inducing apomixis in a plant cell comprising introducing into a 
5 responsive plant cell at least one LEC1 polypeptide or at least one LEC1 

polynucleotide and growing the plant cell, wherein the introducing and 
growing is done under conditions sufficient to produce a transformed 
somatic embryo. 

10 42. The method of claim 41 wherein the plant cell is transformed with at least 
one LEC1 polynucleotide. 

43. The method of claim 44 wherein the at least one polynucleotide is operably 
linked to a promoter driving expression in a plant cell 

15 

44. The method of claim 41 further comprising suppressing in the plant cell the 
expression of an FIE polycomb polynucleotide using sense or antisense 
methods . 

20 45. The method of claim 41 further comprising growing the embryo under plant 
growing conditions to produce a regenerated plant. 

46. The method of claim 42 wherein the promoter is an inducible promoter. 

25 47. The method of claim 42 wherein the at least one polynucleotide is 
expressed in integument or nucellus tissue. 

48. A plant produced by the method of claim 41 , 

30 49. The plant of claim 48, wherein the plant is male sterile. 
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50. A method for increasing transformation efficiency comprising introducing at 
least one LEC1 polypeptide or at least one LEC1 polynucleotide into a 
responsive plant cell under conditions sufficient to increase transformation 
efficiency. 

5 

51 . The method of claim 50 wherein the transformation is conducted in medium 
that retards growth of somatic embryo growth in non-transformed plants. 

52. The method of claim 51 wherein transformation is conducted with reduced 
10 levels of auxin or no auxin. 

53. The method of claim 50 wherein the plant cell is transformed with at least 
one LEC1 polynucleotide. 

15 54. The method of claim 53 wherein the at least one polynucleotide is operably 
linked to a promoter driving expression in a plant cell 

55. The method of claim 50 wherein the plant cell is a recalcitrant cell. 

20 56. The method of claim 55 wherein the plant cell is an inbred cell. 

57. A method for increasing recovery of regenerated plants comprising 
introducing into a responsive plant cell at least one LEC1 polypeptide or at 
least one LEC1 polynucleotide and growing the plant cell, wherein the 

25 introducing and growing are under conditions sufficient to produce a 

regenerated plant. 

58. The method of claim 57 wherein the plant cell is transformed with at least 
one LEC1 polynucleotide. 



30 



59. The method of claim 58 wherein the at least one polynucleotide is operably 
linked to a promoter driving expression in a plant cell 
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60. The method of claim 57 wherein the plant cell is a recalcitrant cell. 

61 . The method of claim 60 wherein the plant cell is an inbred plant cell. 
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SEQUENCE LISTING 

<110> Pioneer Hi-Bred International, Inc. 
E# i. du Pont de Nemours * Co. 



DuPont (U.K.) Limited 

<120> Transcriptional Activator Nucleic Acids, 
Polypeptides, and Methods of Use Thereof 

<130> 0943-PCT 

<150> 60/107,643 
<151> 1998-11-09 

<160> 23 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 1173 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (69) ... (902) 

ccacgcgtcTgccaccacac cacgagcgcg cgataaccct agctagcttc aggtagtagc 60 
gagagcca atg gac tec age age ttc etc cct gee gcc ggc gcg gag aat liu 
Met Asp Ser Ser Ser Phe Leu Pro Ala Ala Gly Ala Glu Asn 
1 5 1° 

ggc teg gcg gcg ggc ggc gcc aac aat ggc ggc get get cag cag cat 
Gly Ser Ala Ala Gly Gly Ala Asn Asn Gly Gly Ala Ala Gin Gin Hxs 
15 20 25 30 

gcg gcg ccg gcg ate cgc gag cag gac egg ctg atg ccg ate gcg aac 
Ma Ala Pro Ala lie Arg Glu Gin Asp Arg Leu Met Pro He Ala Asn 
35 40 45 

gtg ate cgc ate atg egg cgc gtg ctg ccg gcg cac gee aag ate teg 
Val lie Arg He Met Arg Arg Val Leu Pro Ala Has Ala Lys lie Ser 
50 55 60 

gac gac gcc aag gag acg ate cag gag tgc gtg teg gag tac ate age 
Lp Asp Ala Lys Glu Thr He Gin Glu Cys Val Ser Glu Tyr lie Ser 
65 70 75 

ttc ate acg ggg gag gcc aac gag egg tgc cag egg gag cag cgc aag 
Phe He Thr Gly Glu Ala Asn Glu Arg Cys Gin Arg Glu Gin Arg Lys 
80 85 90 

acc ate acc gcc gag gae gtg ctg tgg gcc atg age cgc etc ggc ttc 
Thr He Thr Ala Glu Asp Val Leu Trp Ala Met Ser Arg Leu Gly Phe 
95 100 105 11° 

gac gac tac gtc gag ccg etc ggc gcc tac etc cac cgc tac cgc gag 
Asp Asp Tyr Val Glu Pro Leu Gly Ala Tyr Leu His Arg Tyr Arg Glu 



158 



206 



254 



302 



350 



398 



446 
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115 



120 



125 



«. T i" K s K j- gj 2 c s s a s £ 

Phe Glu Gly Asp Ala Arg Gly Val w UQ 
130 

~ K S5 2 = K S S - 2 2 - s K ~ 

Ser Arg Gly Gly Asp his his rru g 
145 150 

£ £ £ S «3 S S £ 5 - = 2 S: r " S: = 

160 165 
cac cac cac cac ,ac atg cag atg cac gcc gcc atg tac ggg gga acg 
His His His His Asp Met Gin Met Hxs Ala Ala Met Tyr exy y 
175 "0 185 

s a £ s £ s a £ a i = a a a I a 

195 200 

s s £ = a a = = i a a a s a s = 

210 215 

«- ^ e arc ata get gca tac tat gga ggc gcc gcg 

S £ E S S £ S a L. L TV, Tvr Gly «. «. 

225 230 
tac gcg ccc ggc aac ggc ggg age ggc gac ggc agt ggc agt ggc „e 
Tyr Ala Pro Gly Asn Gly Gly Ser Gly Asp eiy y 
240 245 

„t «, 7 r « 2 s « s s a s a a a s 

Gly Gly Gly Ser Ala Ser His inr rto j 2?() 

255 260 

cac ccg cac ccg ttc gcg tac aag tagctagttc gtacgtcgtt cgacttgagc 
His Pro His Pro Phe Ala Tyr Lys 
275 

aagecatega tetgetgate tgaacgtacg ctgtattgta cacgcatgca egta^gtate J92 
ggeggctage tctcctgttt aagttgtact ^gattctgt ccegg gg tgccagtg 1112 
g?a?cttcct tcagtctcta gtttcttagc *9«gtagaa gtgttc g | aaaaaa u72 
tgttgtttta gggccggggt aaaccatccg atgagattat ttcaaa n73 
a 

<210> 2 
<211> 278 
<212> PRT 
<213> Zea mays 

„« £ 4 £ £ » «- «» Pro Ma M. sly - «. «. sly 
A. Ala tlI Sly M. » »» Sly sly «- «• «» «« 

.» u. n. a a. «• «p - M « E " Ile *" Val Ile 



542 



590 



63B 



686 



734 



782 



830 



878 



932 



35 40 
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Arg He Met Arg Arg Val Leu Pro Ala His Ala Lys He Ser Asp Asp 

55 60 



Ala Lys Glu Thr lie Gin Glu Cys Val Ser Glu Tyr He Ser Phe lie 

Thr Gly Glu Ala Asn II Arg cys Gin Arg Glu Gin Arg Lys Thr lie 

Thr Ala Glu Asp Val Leu Trp Ala Met Ser Arg Leu Gly Phe Asp Asp 

100 10-> 
Tyr Val Glu Pro Leu Gly Ala Tyr Leu His Arg Tyr Arg Glu Phe Glu 



Gly Asp Ma Arg Gly Val Gly Leu Val Pro Gly Ala Ala Pro Ser Arg 



115 120 
Ala Arg Gly Val Gly Leu 

Gly III Asp His His Pro His Ser Met Ser Pro Ala Ala Met Leu ly. 

Ser Arg Gly Pro Val Ser Gly Ala Ala Met Leu Pro His His His His 
165 1'° 
Gin Met His Ala Ala 
180 185 
Pro Pro Pro Ala Gly Pro Pro His His Gly Gly Phe Leu Met Pro His 



Pro Gin Gly Ser Ser His Tyr Leu Pro Tyr Ala Tyr Glu Pro Thr Tyr 

Gly III Glu His Ala Met Ma Ala Tyr Tyr Gly Gly Ala Ala Tyr Ala 

* 230 23b 

Pro Gly Asn Gly Gly Ser Gly Asp Gly Ser Gly Ser Gly Gly Gly Gly 

Gly Ser Ala Ser hII Thr Pro Gin Gly Ser Gly Gly Leu Glu His Pro 

260 265 
His Pro Phe Ala Tyr Lys 
275 

<210> 3 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 3 20 
tagtagcgag agccaatgga 

<210> 4 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 4 20 
gccgggacag aatcacagta 

<210> 5 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 



WO 00/28058 



4 



PCT/US99/26514 



<400> 5 20 
tagtagcgag agccaatgga 

<210> 6 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 6 
cccggcccta aaacaacaca 



<210> 7 
<211> 481 
<212> DNA 

<213> Argemone mexicana 

<220> 

<221> CDS 

<222> (44) . . . (481) 

<221> misc_feature 
<222> (1)...(481) 
<223> n = A,T,C or G 



<400> 7 cc 
cgagagaaag agttggtgaa gaagaagaag aagttgaaaa gag atg gaa cgt ggt 



Met Glu Arg Gly 
1 



ggt ggt ggt ggt ggt agt ggt ggt ggt ttc cat gga tat cag aaa etc 
Gly Gly Gly Gly Gly Ser Gly Gly Gly Phe His Gly Tyr Gin Lys Leu 

10 15 20 



103 



5 



cca aaa tea aac tec get gga atg atg etc teg gag eta teg aat aac 151 
Pro Lys Ser Asn Ser Ala Gly Met Met Leu Ser Glu Leu Ser Asn Asn 
25 30 35 

aac aac aat att gac gta aac tct aca tgt act gta cga gag caa gat 
Asn Asn Asn lie Asp Val Asn Ser Thr Cys Thr Val Arg Glu Gin Asp 



199 



40 



45 50 



247 



cga tac atg cca att get aat gtg ate agg ate atg cgt aag gta ctt 
Ara Tyr Met Pro He Ala Asn Val lie Arg He Met Arg Lys Val Leu 
55 60 65 

cct act cat gec aag ate tct gac gat gee aaa gaa act ate caa gaa 295 
Pro Thr His Ala Lys lie Ser Asp Asp Ala Lys Glu Thr He Gin Glu 
70 75 80 

tgt gtc tea gaa tac ate agt ttc ate aca agt gaa gec aat gat cgt 343 
Cys Val Ser Glu Tyr He Ser Phe lie Thr Ser Glu Ala Asn Asp Arg 
85 90 95 1 

tqc caa cgt gaa caa aga aag aca ate aca get gaa gat gtt tta tgg 391 
Cys Gin Arg Glu Gin Arg Lys Thr lie Thr Ala Glu Asp Val Leu Trp 
* 105 HO H5 
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gcg atg age aaa eta ggg ntt gat gag tac att gaa C ct- eta act ctt 
Ala Met Ser Lys Leu Gly Xaa Asp Glu Tyr lie Glu Pro Leu Thr Leu 
120 125 13° 

tac ctt caa cgt tat cgt gag ttt gaa ggt gna cgt tgg tea 
Tyr Leu Gin Arg Tyr Arg Glu Phe Glu Gly Xaa Arg Trp Ser 
135 140 145 



439 



481 



<210> 8 
<211> 146 
<212> PRT 

<213> Argemone mexicana 



<220> 

<221> VARIANT 

<222> (U...U46) 

<223> Xaa « Any Amino Acid 



<400> 8 

Met Glu Arg Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Phe His Gly 

1 5 10 15 

Tyr Gin Lys Leu Pro Lys Ser Asn Ser Ala Gly Met Met Leu Ser Glu 

20 25 30 

Leu Ser Asn Asn Asn Asn Asn lie Asp Val Asn Ser Thr Cys Thr Val 

35 . 40 45 

Arq Glu Gin Asp Arg Tyr Met Pro He Ala Asn Val He Arg He Met 

50 55 60 

Arg Lys Val Leu Pro Thr His Ala Lys He Ser Asp Asp Ala Lys Glu 

65 70 75 ? 

Thr He Gin Glu Cys Val Ser Glu Tyr He Ser Phe He Thr Ser Glu 

85 90 95 

Ala Asn Asp Arg Cys Gin Arg Glu Gin Arg Lys Thr He Thr Ala Glu 

100 105 11° 

Asp Val Leu Trp Ala Met Ser Lys Leu Gly Xaa Asp Glu Tyr He Glu 

115 120 125 

Pro Leu Thr Leu Tyr Leu Gin Arg Tyr Arg Glu Phe Glu Gly Xaa Arg 

130 135 140 

Trp Ser 
145 



<210> 9 

<211> 942 

<212> DNA 

<213> Glycine max 

<220> 
<221> CDS 

<222> (3)... (722) .. 



gc acg age tct ctt ata ate aca cac aca cct ace tta ata get atg 
Thr Ser Ser Leu He He Thr His Thr Pro Thr Leu He Ala Met 
1 5 10 15 



95 
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143 

Val Ala 

40 " 45 



gag ttg aag ctg tea gtg tea gac atg aac atg agg «g cag gta gca 
Gly Leu Lys Leu Ser Val Ser Asp Met Asn Met Arg Gin Gin Val Ala 



35 

tea tea gat cac agt gca gee aca gga gag gag aac gaa tgc acg gtg 
Ser Ser Asp His sir Ma Ala Thr Gly Glu Glu Asn Glu Cys Thr Val 
50 55 

agg gag caa gac agg ttc atg cca ate gee aac gtg att agg ate atg 
Are Gil Gin Asp Arg Phe Met Pro lie Ala Asn Val He Arg lie Met 
65 70 75 

cgc aag att etc cct cca cac gca aaa ate teg gac gat gca aaa gaa 
Arg Lys He Leu Pro Pro His Ala Lys He Ser Asp Asp Ala Lys Glu 
80 85 90 95 

aca ate caa gag tgc gtg tct gag tac ate age ttc ate aca ggt gag 
Thr He Gin Glu Cys Val Ser Glu Tyr He Ser Phe He Thr Gly Glu 
100 105 HO 

gcg aac gag cgt tgc cag agg gag cag egg aag acc ata acc gca gag 
Ala Asn Glu Arg Cys Gin Arg Glu Gin Arg Lys Thr He Thr Ala Glu 
115 120 125 

gac gtg ctt tgg gec atg age aag ctt gga ttc gac gac tac ate gaa 
Asp Val Leu Trp Ala Met Ser Lys Leu Gly Phe Asp Asp Tyr He Glu 
130 135 14° 

ccg ttg acc atg tac ctt cac cgc tac cgt gaa ctt gag ggt gac cgc 
Pro Leu Thr Met Tyr Leu His Arg Tyr Arg Glu Leu Glu Gly Asp Arg 
145 150 155 

acc tct atg agg ggt gaa cca etc ggg aag agg act gtg gaa tac gee 
?hr Ser nil Arg 2y Glu Pro Leu Gly Lys Arg Thr Val Glu Tyr Ala 
160 165 170 175 

acg ctt ggt gtt get act get ttt gtc cct cca ccc tat cat cac cac 
Thr Leu Gly Val Ala Thr Ala Phe Val Pro Pro Pro Tyr His His His 
180 185 I 90 

aat ggg tac ttt ggt get gee atg ccc atg ggg act tac gtt agg gaa 
Zn Gly Tyr Phe 2y Ala Ala Met Pro Met Gly Thr Tyr Val Arg Glu 
195 200 205 

acg cca cca aat aca gec tec tec cat cac cac cac cac cac cac cac 
Ala Pro Pro Asn Thr Ala Ser Ser His His His His His His His His 
210 215 220 

cac cat get cgt gga ate tec aat get cat gaa cca aat get cgc tec 
His His Ala Arg Gly He Ser Asn Ala His Glu Pro Asn Ala Arg Ser 
225 230 235 



191 



239 



287 



335 



383 



431 



479 



527 



575 



623 



671 



719 



772 



ata taaaattata taattatgac taggattcag aacaagactt gatgatgatt 
He 
240 

agcttaactc tcagtaattg gtgctagagt actactgttg ttgaggatac "tattttat 832 
aattaaggge tgggaaggga gttagtatat tectaatcct aaetatgtgc atetttaatt 892 
tatgaaatca ctttgtttta acctttgatg aaaaaaaaaa aaaaaaaaaa 
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<210> 10 

<211> 240 
<212> PRT 
<213> Glycine max 

<400> 10 

Thr Ser Ser Leu He He Tbr His Thr Pro Thr Leu He Ala Met Glu 

1 5 10 15 

Thr Glv Gly Phe His Gly Tyr Arg Lys Leu Pro Asn Thr Thr Ala Gly 

20 25 30 

Leu Lys Leu Ser Val Ser Asp Met Asn Met Arg Gin Gin Val Ala Ser 

35 40 45 

Ser Asp His Ser Ala Ala Thr Gly Glu Glu Asn Glu Cys Thr Val Arg 

50 55 60 

Glu Gin Asp Arg Phe Met Pro He Ala Asn Val He Arg He Met Arg 
65 70 75 

Lvs He Leu Pro Pro His Ala Lys He Ser Asp Asp Ala Lys Glu Thr 

85 90 95 

He Gin Glu Cys Val Ser Glu Tyr He Ser Phe He Thr Gly Glu Ala 

100 105 11° 

Asn Glu Arg Cys Gin Arg Glu Gin Arg Lys Thr He Thr Ala Glu Asp 

115 120 125 

Val Leu Trp Ala Met Ser Lys Leu Gly Phe Asp Asp Tyr He Glu Pro 

130 135 14° 

Leu Thr Met Tyr Leu His Arg Tyr Arg Glu Leu Glu Gly Asp Arg Thr 
145 150 155 160 

Ser Met Arg Gly Glu Pro Leu Gly Lys Arg Thr Val Glu Tyr Ala Thr 

165 170 I' 5 

Leu Gly Val Ala Thr Ala Phe Val Pro Pro Pro Tyr His His His Asn 

180 185 190 

Glv Tyr Phe Gly Ala Ala Met Pro Met Gly Thr Tyr Val Arg Glu Ala 

195 200 205 

Pro Pro Asn Thr Ala Ser Ser His His His His His His His His His 

210 215 220 

His Ala Arg Gly He Ser Asn Ala His Glu Pro Asn Ala Arg Ser He 



<210> 11 
<211> 905 
<212> DNA 

<213> Veronia roespilifolia 

<220> 

<221> CDS 

<222> (58) . . . (699) 



gcacga^ccTatttctagag agagaacgag agagaattct ctaaagagga aaaatag atg 60 

1 



gaa cgt gga gga ggt ttc cat ggc tac cac agg etc ccc ate cac cct 
Glu Arg Gly Gly Gly Phe His Gly Tyr His Arg Leu Pro lie His Pro 

aca tct gga ate caa caa teg gat atg aag eta aag eta cca gaa atg 
Thr Ser Gly He Gin Gin Ser Asp Met Lys Leu Lys Leu Pro Glu Met 
20 25 30 



108 



156 
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acc aac aat aac teg tec act gat gac aat gag tgc ace gtt cga gaa 
Thr Asn Asn Asn Ser Ser Thr Asp Asp Asn Glu Cys Thr Val Arg Slu 
35 40 45 

cag gac cgc ttc atg ccg ata gca aac gtg ate cgc ate atg egg aag 
Gin Asp Arg Phe Met Pro He Ala Asn Val lie Arg lie Met Arg Lys 
50 55 60 « 

ate ctt cct cca cat gee aag ate tct gat gat gee aaa gag acg ate 
lie Leu Pro Pro His Ala Lys lie Ser Asp Asp Ala Lys Glu Thr He 
70 75 8 

caa aaa tgt gtt tea gag tac att age ttt gtc aca ggc gag gca aat 348 
Sn 111 Cys Val Ser 111 Tyr lie Ser Phe Val Thr Gly Glu Ala Asn 

90 95 



85 



gac cgc tgc cag cgt gag caa agg aag acc ate aca get gaa gat gtg 
Asp Arg Cys Gin Arg Glu Gin Arg Lys Thr lie Thr Ala Glu Asp Val 



100 



etc tgg get atg age aaa ctg gga ttt gat gat tat ate gag ccc ttg 
Leu Trp Ala Met Ser Lys Leu Gly Phe Asp Asp Tyr He Glu Pro Leu 

120 125 



115 



act gtg tat etc cat cgc tac agg gag ttt gat ggt ggc gaa cgt gga 
Thr Val Tyr Leu His Arg Tyr Arg Glu Phe Asp Gly Gly Glu Arg Gly 
130 " 135 140 145 



tec ata agg ggt gag ccc ctt gtg aag agg agt act tct gat cct ggt 
Ser III Arg G?y Glu Pro Leu Val Lys Arg Ser Thr Ser Asp Pro Gly 
~ a 155 160 



150 



cac ttt ggg atg get tct ttt gtg cct get ttt cat atg ggt cat cat 
Sis Ht HI Me? Ala Ser Phe Val Pro Ala Phe His Met Gly His His 

170 175 



165 



aac ggc ttc ttt ggt cct gca age att ggt ggt ttc ctg aaa gac cca 
Asn Gly Phe Phe Gly Pro Ala Ser lie Gly Gly Phe Leu Lys Asp Pro 

185 



180 



teg agt get ggc cct teg gga cct gca gtc get ggg ttt gag ccg tat 
Se? Ser Ala Gly Pro Ser Gly Pro Ala Val Ala Gly Phe Glu Pro Tyr 

200 205 



195 



204 



252 



300 



396 



444 



492 



540 



588 



636 



684 



739 



get cag tgt aaa gag taactgcaaa aagtaggggt tgggatgaga tgatgatgat 
Ala Gin Cys Lys Glu 
210 

ggtggtggtg gtggtggttt gttttgtttt gttctttctt ttttttttct tctttctttt 799 
StSJtSt? gaggaacaaa cttacattgg ttcaetttgg etaggcatgt aaacggttaa 859 
catgettate aagtagtagt tttcgatcaa aaaaaaaaaa aaaaaa 

<210> 12 
<211> 214 
<212> PRT 

<213> Veronia mespilifolia 

Met Glu^g Gly Gly Gly Phe His Gly Tyr His Arg Leu Pro lie His 
1 5 10 15 
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Pro Thr Ser Gly He Gin Gin Ser Asp Met Lys Leu Lys Jju Pro Glu 

20 25 
Met Thr Asn Asn Asn Ser Ser Thr Asp Asp Asn Glu Cys Thr Val Arg 

35 40 
Glu Gin Asp Arg Phe Met Pro lie Ala Asn Val lie Arg He Met Arg 

so 55 60 

Lys lie Leu Pro Pro His Ala Lys He Ser Asp Asp Ala Lys Glu Thr 

70 75 
lie Gin Glu Cys Val Ser Glu Tyr He Ser Phe Val Thr Gly Glu Ala 

85 90 95 

Asn Asp Arg Cys Gin Arg Glu Gin Arg Lys Thr lie Thr Ala Glu Asp 

100 105 110 

Val Leu Trp Ala Met Ser Lys Leu Gly Phe Asp Asp Tyr lie Glu Pro 

115 120 I 25 

Leu Thr Val Tyr Leu His Arg Tyr Arg Glu Phe Asp Gly Gly Glu Arg 

130 135 140 

Gly Ser He Arg Gly Glu Pro Leu Val Lys Arg Ser Thr Ser Asp Pro 
145 150 I 5 * 1 

Glv His Phe Gly Met Ala Ser Phe Val Pro Ala Phe His Met Gly His 

y 165 170 175 

His Asn Gly Phe Phe Gly Pro Ala Ser lie Gly Gly Phe Leu Lys Asp 

180 i85 190 

Pro Ser Ser Ala Gly Pro Ser Gly Pro Ala Val Ala Gly Phe Glu Pro 

195 200 205 

Tyr Ala Gin Cys Lys Glu 
210 

<210> 13 

<211> 763 

<212> DNA 

<213> Zea mays 

<220> 

<221> CDS 

<222> (1) ... (480) 

<400> 13 4ft 
oca cga ggc aag acc gtc acc tec gag gac ate gtg tgg gee atg age 48 
Sa Sg Gly Lys Thr Val Thr Ser Glu Asp lie Val Trp Ala Met Ser 
I 5 10 

cgc etc ggc ttc gac gac tac gtc gcg ccc etc ggc gee ttc etc cag 
Arg Leu Gly Phe Asp Asp Tyr Val Ala Pro Leu Gly Ala Phe Leu Gin 
20 25 

cac atg cgc gac gac age gac cac ggc ggt gaa gag cgc ggc ggc cct 
Arg Me? Arg Asp Asp Ser Asp His Gly Gly Glu Glu Arg Gly Gly Pro 
35 40 45 

gca ggg cgt ggt ggc teg cgc cgc ggc teg teg tec ttg ccg etc cac 
Ala Gly Arg Gly Gly Ser Arg Arg Gly Ser Ser Ser Leu Pro Leu His 
50 55 60 

tgc ccg cag cag atg cac cac ctg cac cca gee gtc tgc egg cgt ccg 
Cys Pro Gin Gin Met His His Leu His Pro Ala Val Cys Arg Arg Pro 
65 70 75 80 

cac cag age gtg teg cct get gca gga tac gee gtc egg ccc gtt ccc 
Ss Gin Ser Val Ser Pro Ala Ala Gly Tyr Ala Val Arg Pro Val Pro 
85 9° 95 



96 



144 



192 



240 



288 
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cac eca ato eca gcc cgt ggg tac cgc atg cag ggc gga gac cac cgc 
2g III £t Pro Ala Arg gj T yr Arg Met Gin Gly Gly Asp His Arg 
* 105 i10 



100 



aac ctg ggc ggc gtg get ccc tgc age tac gga ggg gee etc gtc cag 
Ser ?al Sy Gly Val Ala Pro Cys Ser Tyr Gly Gly Ala Leu Val Gin 

- ~ ** 123 



115 120 



gcc ggt gga acc caa cac gtt gtt gga ttc cac gac gac gag gca age 
Ala Gly Gly Thr Gin His Val Val Gly Phe His Asp Asp Glu Ala Ser 



130 



135 140 



tct tea agt gaa aat ccg ccg ccg gag ggg cgt gcc get ggc teg aac 
Ser Ser Ser Glu Asn Pro Pro Pro Glu Gly Arg Ala Ala Gly Ser Asn 
145 150 155 

taacctagct tctcagttcc ccgtgtacaa taagaggggc ggtcgcggcg ccgcgccgcg 
cccttgggtt gggccgggcg etatgetgea gtttggtttg taaactaacg agectagggt 
agctggtgca cgcgcgccac ctegceggae gtcgccgtcg tegteggcat g?acttaacc 
ggegggeect gttgttattt ctcaagtttg tagccaacgc actgttcggt gcgttccata 
atttaattta ccatgttgct ctcgaaaaaa aaaaaaaaaa aaa 



336 



384 



432 



480 



540 
600 
660 
720 
763 



<210> 14 
<211> 160 
<212> PRT 
<213> Zea mays 



<400> 14 

= Thr val Thr Ser Glu Asp He Val Trp Ala Met Ser 

10 15 
Tyr Val Ala Pro Leu Gly Ala Phe Leu Gin 

25 30 
Asp His Gly Gly Glu Glu Arg Gly Gly Pro 

40 45 
Arg Arg Gly Ser Ser Ser Leu Pro Leu His 
55 ~ 60 

His Leu His Pro Ala Val Cys Arg Arg Pro 

75 80 
Ala Ala Gly Tyr Ala Val Arg Pro Val Pro 

90 95 
Gly Tyr Arg Met Gin Gly Gly Asp His Arg 

105 HO 
Pro Cys Ser Tyr Gly Gly Ala Leu Val Gin 

120 " 125 

Val Val Gly Phe His Asp Asp Glu Ala Ser 
135 140 

Pro Pro Glu Gly Arg Ala Ala Gly Ser Asn 
155 160 

<210> 15 
<211> 622 
<212> DNA 
<213> Zea mays 



Ala Arg 


Gly Lys Thr 


Val 


1 




5 




Arg 


Leu 


Gly Phe Asp Asp 




20 




Arg Met 


Arg Asp Asp 


Ser 






35 




Ala 


Gly 


Arg Gly Gly 


Ser 




50 






Cys 


Pro 


Gin Gin Met 


His 


65 






70 


His 


Gin 


Ser Val Ser 


Pro 






85 




Arg 


Pro 


Met Pro Ala Arg 




100 




Ser 


Val 


Gly Gly Val 


Ala 






115 




Ala 


Gly 


Gly Thr Gin 


His 




130 






Ser 


Ser 


Ser Glu Asn 


Pro 


145 






150 



<220> 

<221> CDS 

<222> (3)... (622) 

<221> misc feature 
<222> (1).7. (622) 
<223> n = A,T,C or G 
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gc .Jmj! ccc caa aac cct aaa gcc agt get cct tgc acc ttg 47 
gc atg aac aat ^ _ _ cor in a t>~« r*#* Thr Leu 



Met Asn Asn Pro Gin Aan Pro Lys Ala Ser Ala Pro Cys Thr Leu 
5 10 is 



cca ccg gag ctt ccc aaa gaa gca gtg gcg acc gac gaa gca ccg ccg 
Pro Pro Glu Leu Pro Lys Glu Ala Val Ala Thr Asp Glu Ala Pro Pro 
20 25 30 

cca atg ggc aac aac aac aac acg gaa teg gcg acg gcg acg atg gtc 
Pro Me? lly Asn Asn Asn Asn Thr Glu Ser Ala Thr Ala Thr Met Val 
35 40 45 



egg gag cag gac egg ctg atg ccc gtg gcc aac gtg tec cgc ate atg 
Arg Glu Gin Asp Arg Leu Met Pro Val Ala Asn Val Ser Arg He Met 
50 55 60 

cgc caa gtg ctg cct ccg tac gee aag ate tec gac gac gee can gaa 
Arg Gin Val Leu Pro Pro Tyr Ala Lys He Ser Asp Asp Ala Xaa Glu 
65 70 75 

gtn ate caa gaa ttg ctn ttc gga att tea tea ctt ncg tec tgg cga 
Xaa He Gin Glu Leu Xaa Phe Gly lie Ser Ser Leu Xaa Ser Trp Arg 



80 



85 90 95 



ggc gaa acg aag egg tgc cac acc gag cgc cgc aag acc gtc acc tec 
III Glu Thr Lys Arg Cys His Thr Glu Arg Arg Lys Thr Val Thr Ser 
100 1° 5 110 

oaa aac ate gtg tgg gcc atg age cgc etc ggc ttc gac gac tac gtc 
111 Asp lit Val Tr? Ala Mel Ser Arg Leu Gly Phe Asp Asp Tyr Val 
115 120 125 

gcg ccc etc ggc gcc ttc etc cag cgc atg cgc gac nac age gaa cac 
Ala Pro Leu Gly Ala Phe Leu Gin Arg Met Arg Asp Xaa Ser Glu His 
130 135 140 

ggg ggt gaa aac gcg gcg gcc tgc ang ggg tng tgg ten cgc cgc ggg 
Glv Gly Glu Asn Ala Ala Ala Cys Xaa Gly Xaa Trp Xaa Arg Arg Gly 
145 150 "5 

teg tct net tgg cgc tec ctt gee gca ana gat gac aac ttg cac caa 
Ser Ser Xaa Trp Arg Ser Leu Ala Ala Xaa Asp Asp Asn Leu Hxs Gin 
160 165 170 175 

aca tct acc ggg ntc gga cca aaa ctn ttc cct gtt gca gga ata ccc 
HI sir HI HI Xaa Sy Pro Lys Xaa Phe Pro Val Ala Gly lie Pro 
180 185 190 



gtc cng ggc cnt tec ccc ccn aat cca acc att tgg ttt ccc ctt gc 
Val Xaa Gly Xaa Ser Pro Xaa Asn Pro Thr lie Trp Phe Pro Leu 
195 200 205 



95 



143 



191 



239 



287 



335 



383 



431 



479 



527 



575 



622 



<210> 16 
<211> 206 
<212> PRT 
<213> Zea mays 

<220> 
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<221> VARIANT 

<222> (!)••• (206) 

<223> Xaa = Any Amino Acid 

Met Asn'^n Pro Gin Asn Pro Lys Ala Ser Ala Pro Cys Thr Leu Pro 

Pro Glu Leu Pro Lys Glu Ala Val Ala Thr Asp Glu Ala Pro Pro Pro 

20 25 30 

Met Gly Asn Asn Asn Asn Thr Glu Ser Ala Thr Ala Thr Met Val Arg 

35 40 45 

Glu Gin Asp Arg Leu Met Pro Val Ala Asn Val Ser Arg lie Met Arg 

50 5$ ^ 

Gin Val Leu Pro Pro Tyr Ala Lys He Ser Asp Asp Ala Xaa Glu Xaa 

65 70 75 

lie Gin Glu Leu Xaa Phe Gly He Ser Ser Leu Xaa Ser Trp Arg Gly 

85 90 95 

Glu Thr Lys Arg Cys His Thr Glu Arg Arg Lys Thr Val Thr Ser Glu 

100 105 11° 

Asp He Val Trp Ala Met Ser Arg Leu Gly Phe Asp Asp Tyr Val Ala 

115 120 125 

Pro Leu Gly Ala Phe Leu Gin Arg Met Arg Asp Xaa Ser Glu His Gly 

130 135 i4 ° 

Gly Glu Asn Ala Ala Ala Cys Xaa Gly Xaa Trp Xaa Arg Arg Gly Ser 
145 150 155 loU 

Ser Xaa Trp Arg Ser Leu Ala Ala Xaa Asp Asp Asn Leu His Gin Thr 

165 170 I 73 

Ser Ala Gly Xaa Gly Pro Lys Xaa Phe Pro Val Ala Gly lie Pro Val 

180 185 I 90 

Xaa Gly Xaa Ser Pro Xaa Asn Pro Thr He Trp Phe Pro Leu 
195 200 205 



<210> 17 
<211> 1121 
<212> DNA 
<213> Glycine max 

<220> 

<221> CDS 

<222> (3) . . . (1121) 



<400> 17 47 

ac aca agg gaa act gga ggc ttt cat ggc tac cgc aag etc ccc aac 

9 Thr Arg Glu Thr lly Gly Phe His Gly Tyr Arg Lys Leu Pro Asn 
1 5 10 15 

aca ace tct ggg ttg aag ctg tea gtg tea gac atg aac atg aac atg 95 
Thr Thr Ser Gly Leu Lys Leu Ser Val Ser Asp Met Asn Met Asn Met 

25 30 



20 



aoa cag cag cag gta gca tea tea gat cag aac tgc age aac cac agt 143 
Arg III 111 Gin Val Ala Ser Ser Asp Gin Asn Cys Ser Asn Hrs Ser 
35 40 45 

gca gca gga gag gag aac gaa tgc acg gtg agg gag caa gac agg ttc 191 
Ala Ala Gly Glu Glu Asn Glu Cys Thr Val Arg Glu Gin Asp Arg Phe 
50 55 60 

ato cca ate get aac gtg ata egg ate atg cgc aag att etc cct cca 239 
Me? Pro lie Ala Asn Val He Arg He Met Arg Lys lie Leu Pro Pro 
65 70 75 
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aa ate tec oat oat oca aag gag aca ate caa gag tgc gtg 
Z E Lys S 2 5 £ S. Iflf «« Thr lie Cin Glu Cys Val 
80 85 90 

tea aaq tac ate age ttc ate ace ggg gag gee aac gag cgt tgc cag 
Ser £u Tyr He Ser Phe lie Thr Gly Glu Ala Asn Glu Arg Cys Gin 
100 105 

aoa aag cag cgc aag ace ata ace gca gag gac gtg ctt tgg gca atg 
Arg £u SS Arg Lys Thr lie Thr Ala Glu Asp Val Leu Trp Ala Met 
115 120 " s 



a«t aaa ctt aga ttc gac gac tac ate gaa ccg tta ace atg tac ctt 
Ser Lys Leu Sy Phe Asp Asp Tyr lie Glu Pro Leu Thr Met Tyr Leu 
' 130 135 I 40 



cac cqc tac cgt gag ctg gag ggt gac cgc ace tct atg agg ggt gaa 
Sis Aca Tyr Arg Glu Leu Glu Gly Asp Arg Thr Ser Met Arg Gly Glu 
145 150 I 55 

cca etc ggg aag agg act gtg gaa tat gee acg ctt get act get ttt 
Pro Leu 111 Lys Arg Thr Val Glu Tyr Ala Thr Leu Ala Thr Ala Phe 
160 * * I 65 170 

gtg ccg cca ecc ttt cat cac cac aat ggc tac ttt ggt get gee atg 
Val Pro Pro Pro Phe His His His Asn Gly Tyr Phe Gly Ala Ala Met 
180 185 190 

ecc atg ggg act tac gtt agg gaa acg cca cca aat get gcg tea tct 
Pro Me? ITy Thr Tyr Val Arg Glu Thr Pro Pro Asn Ala Ala Ser Ser 

200 205 



195 



rat cac cat cat gga ate tec aat get cat gaa cca aat get cgc tec 
As Ss £s His 2y He Ser Asn Ala His Glu Pro Asn Ala Arg Ser 

215 220 



210 



ata taa aat taa tga aga gta ctg ttc agt agg aga aca aga ctt ctt 
Ae * Asn * * Arg Val Leu Phe Ser Arg Arg Thr Arg Leu Leu 
225 230 235 

aoa ctt gat tag ctt aac tct cag tga ttg gtg tta gag tac tgt tgt 
Gly Leu £p * 9 Leu Asn Ser Gin * Leu Val Leu Glu Tyr Cys Cys 



240 



ata tat tec taa tec taa tta tgt gca tct tta att tat gga ata act 
lie Jyr Ser * Ser * Leu Cys Ala Ser Leu lie Tyr Gly lie Thr 
265 270 

tta ttt ttt gtt tta act tct gat aat ttg gat ttt ctg atg ttt aat 
HI Si !£ 55 Thr Ser Lp Asn Leu Asp Phe Leu Met Phe Asn 

280 285 290 

gtg gtt ttg tct ate cct tat taa cag tgc caa get taa ggt ttt age 
Sal Val Leu Ser lie Pro Tyr * Gin Cys Gin Ala * Gly Phe Ser 
295 300 305 



287 



335 



383 



431 



479 



527 



575 



623 



671 



719 



767 



tga gga tgg tta att tta taa tta agg get ggg aat tgg gga gtt agt 
* Gly Trp Leu He Leu * Leu Arg Ala Gly Asn Trp Gly Val Ser 
255 2*0 



863 



911 



959 
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cat get cca aaa tgg aat act tgt act gtt atg ttg ttc tgg tag tga 
His Ala Pro Lys Trp Asn Thr Cys Thr Val Met Leu Phe Trp 
310 315 

taa taa taa aac ctg caa gtt atg ttt atg tat aaa gec act att gat 
tgg tga tga aac oj, ^ ^ ^g ^ ^ Tyr Lys ^ ^ ^ 

F — - 330 335 



1007 



1055 



1103 



caa aat tag aga aat tat cat tta ata agt ate etc eca tgt taa ttt 
Gin Asn * Arg Asn Tyr His Leu lie Ser He Leu Pro Cys Phe 
340 345 

1121 

taa aaa aaa aaa aaa aaa 
* Lys Lys Lys Lys Lys 
355 



<210> 18 

<211> 355 

<212> PRT 

<213> Glycine max 

Thr Arg^lu Thr Gly Gly Phe His Gly Tyr Arg Lys Leu Pro Asn Thr 

Thr Ser Gly Leu Lys Leu Ser Val Ser Asp Met Asn Met Asn Met Arg 

Gin Gin Gin Val Ala Ser Ser Asp Gin Asn Cys Ser Asn His Ser Ala 

35 40 45 

Ala Gly Glu Glu Asn Glu Cys Thr Val Arg Glu Gin Asp Arg Phe Met 
50 55 60 

*~ w.i He Arg He Met Arg . 

65 70 75 80 



Pro lie Ala Asn Val lie Arg lie Met Arg Lys He Leu Pro Pro His 
65 70 75 80 

Ala Lys lie Ser Asp Asp Ala Lys Glu Thr lie Gin Glu Cys Val Ser 



Leu Gly Lys Arg Thr III Glu Tyr Ala Thr Leu Ala Thr Ala Phe Val 



90 95 
Glu Tyr lie Ser Phe lie Thr Gly Glu Ala Asn Glu Arg Cys Gin Arg 

100 1° 5 11 

Glu Gin Arg Lys Thr lie Thr Ala Glu Asp Val Leu Trp Ala Met Ser 

115 120 J- 25 

Lys Leu Gly Phe Asp Asp Tyr He Glu Pro Leu Thr Met Tyr Leu His 

130 140 
Arg Tyr Arg Glu Leu Glu Gly Asp Arg Thr Ser Met Arg Gly Glu Pro 

"* J. DO 

Leu Ala Thr Ala Phe 
165 170 175 

Pro Pro Pro Phe His His His Asn Gly Tyr Phe Gly Ala Ala Met Pro 

180 185 
Met Gly Thr Tyr Val Arg Glu Thr Pro Pro Asn Ala Ala Ser Ser His 

195 200 
His His His Gly He Ser Asn Ala His Glu Pro Asn Ala Arg Ser lie 

210 215 220 

Asn Arg Val Leu Phe Ser Arg Arg Thr Arg Leu Leu Gly Leu Asp Leu 
225 230 235 c ™ 

Asn Ser Gin Leu Val Leu Glu Tyr Cys Cys Gly Trp Leu lie Leu Leu 

245 250 " 3 

Arg Ala Gly Asn Trp Gly Val Ser lie Tyr Ser Ser Leu Cys Ala Ser 

260 265 270 

Leu lie Tyr Gly lie Thr Leu Phe Phe Val Leu Thr Ser Asp Asn Leu 

275 280 285 

Asp Phe Leu Met Phe Asn Val Val Leu Ser lie Pro Tyr Gin Cys Gin 
290 295 300 
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Ala Gly Phe Ser His Ala Pro Lys Trp Asn Thr Cys Thr Val Met Leu 

305 310 315 320 

Phe Trp Trp Asn Leu Gin Val Met Phe Met Tyr Lys Ala Thr lie Asp 

325 330 33 5 

Gin Asn Arg Asn Tyr His Leu lie Ser lie Leu Pro Cys Phe Lys Lys 

340 345 «0 



Lys Lys Lys 
355 

<210> 19 

<211> 796 

<212> DNA 

<213> Glycine max 

<220> 

<221> CDS 

<222> (1)...{513) 

<400> 19 . .„ 

gca cga gca atg gcg gga gtg agg gaa cag gac cag tac atg ccg ata 
Ala Arg Ala Met Ala Gly Val Arg Glu Gin Asp Gin Tyr Met Pro He 
! ' 5 10 15 

gcg aac gtg ata agg ate atg cgt egg att ctg cca gcg cac gcg aag 
Ala Asn Val He Arg He Met Arg Arg He Leu Pro Ala Hrs Ala Lys 
20 25 30 



ate tea gac gac gcg aag gag acg ate cag gag tgc gtg tct gag tac 
He Ser Asp Asp Ala Lys Glu Thr He Gin Glu Cys Val Ser Glu Tyr 
35 40 45 

ate agt ttc ate acg gcg gag gcg aac gag egg tgc cag egg gag cag 
He Ser Phe He Thr Ala Glu Ala Asn Glu Arg Cys Gin Arg Glu Gin 
50 55 60 

egg aag acg gtg acc gca gag gat gtg ttg tgg gcg atg gag aag ctt 
Arg Lys Thr Val Thr Ala Glu Asp Val Leu Trp Ala Met Glu Lys Leu 

qgc ttt gac aac tac get cac cct etc tct ctt tac ctt cac cgc tac 
Gly Phe Asp Asn Tyr Ala His Pro Leu Ser Leu Tyr Leu His Arg Tyr 
85 90 95 

cgc gag agt gaa gga gaa cct get tct gtc aga cgc get tct tct gca 
Are Glu Ser Glu Gly Glu Pro Ala Ser Val Arg Arg Ala Ser Ser Ala 
100 105 11° 

atg ggg ate aat aat aat atg gtg cac cca cct tat att aat tct cat 
Met Gly He Asn Asn Asn Met Val His Pro Pro Tyr He Asn Ser Hrs 



115 



120 125 



ggc ttt gga atg ttt gat ttt gac cca tea teg caa ggg ttt tac agg 
Glv Phe Gly Met Phe Asp Phe Asp Pro Ser Ser Gin Gly Phe Tyr Arg 
130 135 140 

gac gat cat aac get get tct gga tct ggt ggt ttt gtt gcg cct ttt 
Asp Asp His Asn Ala Ala Ser Gly Ser Gly Gly Phe Val Ala Pro Phe 
145 150 155 



96 



144 



192 



240 



288 



336 



384 



432 



480 
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gat cct tat get aac ate aaa egt gat gec ctg tgatcatgta agaacaacaa 533 
Asp Pro Tyr Ala Asn He Lys Arg Asp Ala Leu 

ctagtgcatg ctgettttte acttggttag ttatattcaa gcacaagcac a^ggtgc 593 

aoctacaact atttagcttc atctacaaat ccttttcccc ctc^tou J\ 713 

SSJtagag acaatacttg ttattcattg ttatgctcaa ttgetagett ctattcatcg 713 

tcgactgtct gtattgttga tgttcattac agtaacagat aagatggtaa ctgetttact ^ 
acttcaaaaa aaaaaaaaaa aaa 

<210> 20 

<211> 171 

<212> PRT 

<213> Glycine max 

Ala Arg^a Met Ala Gly Val Arg Glu Gin Asp Gin Tyr Met Pro lie 

1 5 10 15 

Ala Asn Val He Arg lie Met Arg Arg lie Leu Pro Ala His Ala Lys 

20 25 

lie Ser Asp Asp Ala Lys Glu Thr lie Gin Glu Cys Val Ser Glu Tyr 

35 40 
He Ser Phe He Thr Ala Glu Ala Asn Glu Arg Cys Gin Arg Glu Gin 

50 55 60 

Arg Lys Thr Val Thr Ala Glu Asp Val Leu Trp Ala Met Glu Lys Leu 
65 9 70 75 

„ PhP Asd Asn Tyr Ala His Pro Leu Se 
85 90 



Gly Phe Asp Asn Tyr Ala His Pro Leu Ser Leu Tyr Leu His Arg Tyr 

85 90 5,3 

Arg Glu Ser Glu Gly Glu Pro Ala Ser Val Arg Arg Ala Ser Ser Ala 

100 105 110 

Met Gly He Asn Asn Asn Met Val His Pro Pro Tyr lie Asn Ser Hrs 

115 120 
Gly Phe Gly Met Phe Asp Phe Asp Pro Ser Ser Gin Gly Phe Tyr Arg 

130 135 140 

Asp Asp His Asn Ala Ala Ser Gly Ser Gly Gly Phe Val Ala Pro Phe 
145 150 155 

Asp Pro Tyr Ala Asn He Lys Arg Asp Ala Leu 
165 1™ 

<210> 21 
<211> 1098 
<212> DNA 

<213> Triticuro aestivum 

<220> 

<221> CDS 

<222> (55) ... (894) 



gcacgagcaa gtgcgagtgc gactacctgc attgeacett ggctagccct agac atg 

1 

aaa aac gac ggc gtc ccc aac gga cca gcg gcg ccg gca cct ace cag 
III Lp Gly Val Pro Asn Gly Pro Ala Ala Pro Ala Pro Thr Gin 
5 10 15 

ggg acg ccg gtg gtg egg gag cag gac egg ctg atg ccg ate gcg aac 
Gly Thr Pro Val Val Arg Glu Gin Asp Arg Leu Met Pro He Ala Asn 
20 25 30 



105 



153 
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gtg ate cgc ate atg cgc cgt gcg etc cct «ec cac gee aag ate tec 
Val He Arg lie Met Arg Arg Ala Leu Pro Ala His Ala Lys He Ser 
35 40 45 

aac aac ace aag gag gcg att cag gaa tgc gtg tec gag ttc ate age 
Asp S Sa Lys Glu Ala He Gin Glu Cys Val Ser Glu Phe He Ser 
50 55 60 

ttc gtc aec ggc gag gec aac gaa egg tgc cgc atg cag cac cgc aag 
Phe Val Thr Gly Glu Ala Asn Glu Arg Cys Arg Met Gin His Arg Lys 
70 75 80 

aec gtc aac gee gaa gae ate gtg tgg gec eta aac cgc etc ggc ttc 
Thr Val Asn Ala Glu Asp He Val Trp Ala Leu Asn Arg Leu Gly Phe 
85 90 95 

gae gae tac gtc gtg ccc etc age gtc ttc ctg cac cgc atg cgc gac 
Asp Asp Tyr Val Val Pro Leu Ser Val Phe Leu His Arg Met Arg Asp 
P 100 105 H° 

ccc gag gcg ggg aca ggt ggt gec get gca ggc gac age cgc gec gtg 
Pro 111 Ala Gly Thr Gly Gly Ala Ala Ala Gly Asp Ser Arg Ala Val 



115 



120 125 



acg agt gcg cct ccc cgc gcg gee ccg ccc gtg ate cac gec gtg ccg 
JS Ser 111 Pro Pro Arg Ala Ala Pro Pro Val He His Ala Val Pro 
130 135 140 145 

ctg cag get cag cgc ccg atg tac gcg ccc ccg get ccg ttg cag gtt 
Leu Gin Ala Gin Arg Pro Met Tyr Ala Pro Pro Ala Pro Leu Gin Val 
150 155 160 

gag aat cag atg cag egg cct gtg tac get ccc ccg get ccg gtg cag 
Glu Asn Gin Met Gin Arg Pro Val Tyr Ala Pro Pro Ala Pro Val Gin 
165 170 175 

att cag atg cag egg ggc ate tat ggg ccc egg get cca gtg cac ggg 
?5 S Me? Gin Arg Gly He Tyr Gly Pro Arg Ala Pro Val His Gly 
180 185 190 

t ac ace ate gga atg gcg ccc gtg egg gec aac gtc ggc ggg cag tac 
Tyr Ma ill lly Me? Ill Pro Val Arg Ala Asn Val Gly Gly Gin Tyr 
195 " 200 205 

ata t tc age gga gag ggt gtc atg gec cag caa tac tac ggg tac 
111 III Pne Sly Sy Glu Gly Val Met Ala Gin Gin Tyr Tyr Gly Tyr 
210 215 220 225 



ggg tac gag gaa gga gcg tac ggc gca ggt age age aac gga gga gec 
Gly Tyr Glu Glu Gly Ala Tyr Gly Ala Gly Ser Ser Asn Gly Gly Ala 



230 



aec att ggc gac gag gag age teg tec aac ggc gtg ccg gca ccg ggg 
£ J" Gly Asp Glu Glu Ser Ser Ser Asn Gly Val Pro Ala Pro Gly 
245 250 255 

gag ggc atg ggg gag cca gag cca gag cca gca gca gaa gaa teg cat 
111 Gly Met Gly Glu Pro Glu Pro Glu Pro Ala Ala Glu Glu Ser His 
260 265 270 



201 



249 



297 



345 



393 



441 



489 



537 



585 



633 



681 



729 



777 



825 



873 
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65 70 
Lys Thr Val Asn Ala Glu Asp He Val Trp Ala Leu Asn Arg Leu Gly 



gac aag ccc gtc caa tot ggc tagtcgcgtg cgcggcgcgc gttagcttct 
Asp Lys Pro Val Gin Ser Gly 
275 280 

qcatcctgtg tactgtaata atttgccgtg tcgatccggc catggtttgt gtgtgcgtag 
SSSSt! atgtgggctt gtcctctagt aattcatgta "gcttatct -tgtggact 
tgtcctctag taattcatgt actctttgct gttgaaaaaa aaaaaaaaaa aaaa 

<210> 22 
<211> 280 
<212> PRT 

<213> Triticuiu aestivum 

Met Glu^n "p Gly Val Pro Asn Gly Pro Ala Ala Pro Ala Pro Thr 

1 5 10 " 

Gin Gly Thr Pro Val Val Arg Glu Gin Asp Arg Leu Met Pro lie Ala 

20 25 30 

Asn Val He Arg He Met Arg Arg Ala Leu Pro Ala His Ala Lys He 

35 40 45 

Ser Asp Asp Ala Lys Glu Ala lie Gin Glu Cys Val Ser Glu Phe He 

50 55 60 

Ser Phe Val Thr Gly Glu Ala Asn Glu Arg Cys Arg Met Gin His Arg 

Ala Leu Asn Arg Lei 

85 90* 95 

Phe Asp Asp Tyr Val Val Pro Leu Ser Val Phe Leu His Arg Met Arg 

100 105 11° 

Asp Pro Glu Ala Gly Thr Gly Gly Ala Ala Ala Gly Asp Ser Arg Ala 

115 120 125 

Val Thr Ser Ala Pro Pro Arg Ala Ala Pro Pro Val He His Ala Val 

130 135 I 40 

Pro Leu Gin Ala Gin Arg Pro Met Tyr Ala Pro Pro Ala Pro Leu Gin 
145 150 155 lbU 

Val Glu Asn Gin Met Gin Arg Pro Val Tyr Ala Pro Pro Ala Pro Val 

165 l* 70 x,:> 

Gin Val Gin Met Gin Arg Gly He Tyr Gly Pro Arg Ala Pro Val His 

180 I 85 190 

Gly Tyr Ala Val Gly Met Ala Pro Val Arg Ala Asn Val Gly Gly Gin 

195 200 205 

Tyr Gin Val Phe Gly Gly Glu Gly Val Met Ala Gin Gin Tyr Tyr Gly 

210 215 220 

Tyr Gly Tyr Glu Glu Gly Ala Tyr Gly Ala Gly Ser Ser Asn Gly Gly 
225 230 235 ^ u 

Ala Ala He Gly Asp Glu Glu Ser Ser Ser Asn Gly Val Pro Ala Pro 

245 250 2 " 

Gly Glu Gly Met Gly Glu Pro Glu Pro Glu Pro Ala Ala Glu Glu Ser 

His Asp Lys Pro Val Gin Ser Gly 
275 280 

<210> 23 
<211> 65 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> LEC1 consensus protein sequence 



924 



984 
1044 
1098 



<221> VARIANT 
<222> (1) ... (65) 
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<223> Xaa c Any Amino Acid 



Arg Glu Gin Asp Xaa Xaa Met Pro He Ala Asn Val H« Arg lie Met 

1 5 10 15 

Arg Xaa Xaa Leu Pro Xaa His Ala Lys lie Ser Asp Asp Ala Lys Glu 

20 25 30 

Xaa He Gin Glu Cys Val Ser Glu Tyr He Ser Phe Xaa Thr Xaa Glu 

35 40 45 

Ala Asn Xaa Arg Cys Xaa Xaa Xaa Xaa Arg Lys Thr Xaa Xaa Xaa Glu 
Rn 55 60 



Xaa 
65 
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inventions in this international application, as follows: 

1. Claims: 1-62 all partially 

An isolated nucleic acid comprising a member of the group 
consisting of those as embodied in claim 1 of the 
application, especially having the sequence set forth in SEQ 
ID NO: 1,13,15, being a polynucleotide having a plant 
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the protein being a LEC1 transcriptional activator. 
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Methods for positive selection of a transformed cell, for 
inducing apomixis, for increasing transformation efficiency, 
and for increasing recovery of regenerated plants comprising 
the use of an LEC1 polynucleotide as defined above. 
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idem for SEQ ID NO: 7,8, the sequences being derived from 
Argemone 



3. Claims: 1-62 all partially 

Idem for SEQ ID NO :9, 10,17-20, the sequences being derived 
from soybean 



4. Claims: 1-62 all partially 

idem for SEQ ID N0:ll,12, the sequences being derived from 
Veronia 



5. Claims: 1-62 all partially 

idem for SEQ ID N0:21,22, the sequences being derived from 
wheat 
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